https://hgpu.org/?p=15337
Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout