https://hgpu.org/?p=10758
Architecture-and Workload-Aware Heterogeneous Algorithms for Sparse Matrix Vector Multiplication