https://hgpu.org/?p=16703
Shuffle Reduction Based Sparse Matrix-Vector Multiplication on Kepler GPU