https://hgpu.org/?p=8775
Implementing Sparse Matrix-Vector Multiplication with QCSR on GPU