https://hgpu.org/?p=9474
A Performance Modeling and Optimization Analysis Tool for Sparse Matrix-Vector Multiplication on GPUs