https://hgpu.org/?p=5576
Scaling Lattice QCD beyond 100 GPUs