https://hgpu.org/?p=5693
From CUDA to OpenCL: Towards a Performance-portable Solution for Multi-platform GPU Programming