https://hgpu.org/?p=6878
LU Factorization for Accelerator-based Systems