https://hgpu.org/?p=3868
Gemma in April: A matrix-like parallel programming architecture on OpenCL