https://hgpu.org/?p=9354
GPU Sparse Matrix Multiplication with CUDA