https://hgpu.org/?p=1622
Fast matrix multiplies using graphics hardware