https://hgpu.org/?p=18268
Optimizing Sparse Matrix-Vector Multiplication on Emerging Many-Core Architectures