The NemoLite2D Fortran OpenCL manual implementation sometimes produce 0 checksum values. (this may be related to the invalid memory accesses due to sometimes accessing out of boundary values)
Also the OpenCL device is different, this has been observed in POCL and the AMD GPUs.