https://hgpu.org/?p=3407
Exploiting SIMD extensions for linear image processing with OpenCL