https://hgpu.org/?p=4344
FPGA Based High Performance and Scalable Block LU Decomposition Architecture