https://hgpu.org/?p=1388
Improving many flavor QCD simulations using multiple GPUs