https://hgpu.org/?p=6862
A Highly Efficient GPU-CPU Hybrid Parallel Implementation of Sparse LU Factorization