https://hgpu.org/?p=14409
Optimizing strassen matrix multiply on GPUs