https://hgpu.org/?p=15976
Adaptive Multi-level Blocking Optimization for Sparse Matrix Vector Multiplication on GPU