https://hgpu.org/?p=13895
A Framework for General Sparse Matrix-Matrix Multiplication on GPUs and Heterogeneous Processors