https://hgpu.org/?p=2869
An experimental study on performance portability of OpenCL kernels