https://hgpu.org/?p=2437
Accelerating linpack with CUDA on heterogenous clusters