https://hgpu.org/?p=13751
Fast Radix Sort for Sparse Linear Algebra on GPU