https://hgpu.org/?p=5718
Optimizing Linpack Benchmark on GPU-Accelerated Petascale Supercomputer