https://hgpu.org/?p=18114
Sparse Matrix-Matrix Multiplication on Multilevel Memory Architectures : Algorithms and Experiments