The Fat-Link Computation On Large GPU Clusters for Lattice QCD

Guochun Shi, Ronald Babich, Michael A. Clark, Balint Joo, Steven Gottlieb, Volodymyr Kindratenko
National Center for Supercomputing Applications (NCSA), University of Illinois, Urbana, IL 61801, USA
Symposium on Application Accelerators in High-Performance Computing (SAAHPC’12), 2012


   title={The Fat-Link Computation On Large GPU Clusters for Lattice QCD},

   author={Shi, Guochun and Babich, Ronald and Clark, Michael A. and Joo, Balint and Gottlieb, Steven and Kindratenko, Volodymyr},



Download Download (PDF)   View View   Source Source   



Graphics Processing Units (GPU) are becoming increasingly popular in high performance computing due to their high performance, high power ef?ciency and low cost. In this paper, we present results of an effort to implement the fatlink computation – an important component of many lattice quantum chromodynamics (LQCD) calculations – on GPU clusters using the QUDA framework. Two implementations, one similar to the original CPU algorithm in the MILC code and one based on the idea of reduced communication by redundant computations, are presented and their relative advantages are discussed. In strong-scaling tests on up to 384 GPUs on Longhorn and 256 GPUs on Keeneland GPU clusters, where the CPU core to GPU ratio is 4:1 in both clusters, we achieved up to 11.4x and 8.7x node speedup when running on the two GPU clusters, respectively.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: