https://hgpu.org/?p=7830
The Fat-Link Computation On Large GPU Clusters for Lattice QCD