https://hgpu.org/?p=9569
FastSpMM: An Efficient Library for Sparse Matrix Matrix Product on GPUs