https://hgpu.org/?p=15544
Enhancing productivity and performance portability of OpenCL applications on heterogeneous systems using runtime optimizations