https://hgpu.org/?p=14703
Comparison of Thread Execution Methods for GPU-oriented OpenCL Programs on Multicore Processors