A very simple GPU kernel (written in PVL) that adds the content of two matrices and stores it in an "output" array (in GPU programming fashion). This program is currently disabled due to refactoring of the Chalice back-end.