https://hgpu.org/?p=9129
Optimizing Sparse Matrix-Matrix Multiplication for the GPU