https://hgpu.org/?p=6334
Autotuning GEMMs for Fermi