https://hgpu.org/?p=7479
An Automatic OpenCL Compute Kernel Generator for Basic Linear Algebra Operations