https://hgpu.org/?p=5689
Optimizing OpenCL Kernels for Iterative Statistical Applications on GPUs