https://hgpu.org/?p=12082
Fine-Grained Parallel Incomplete LU Factorization