https://hgpu.org/?p=2181
Concurrent Number Cruncher: An Efficient Sparse Linear Solver on the GPU