Overlapping Computation and Communication for Advection on Hybrid Parallel Computers
National Center for Atmospheric Research
IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2011
@inproceedings{white2011overlapping,
title={Overlapping Computation and Communication for Advection on Hybrid Parallel Computers},
author={White III, J.B. and Dongarra, JJ},
booktitle={Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International},
pages={59–67},
year={2011},
organization={IEEE}
}
We describe computational experiments exploring the performance improvements from overlapping computation and communication on hybrid parallel computers. Our test case is explicit time integration of linear advection with constant uniform velocity in a three-dimensional periodic domain. The test systems include a Cray XT5, a Cray XE6, and two multicore Infiniband clusters with different generations of NVIDIA graphics processing units (GPUs). We describe results for Fortran implementations using various combinations of MPI, OpenMP, and CUDA, with and without overlap of computation and communication. We find that overlapping CPU computation, GPU computation, parallel communication, and CPU-GPU communication can provide performance improvements of more than a factor of two.
November 27, 2011 by hgpu