high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » CUDA » Overlapping Computation and Communication for Advection on Hybrid Parallel Computers

Overlapping Computation and Communication for Advection on Hybrid Parallel Computers

James B White III (Trey), Jack Dongarra

National Center for Atmospheric Research

IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2011

DOI:10.1109/IPDPS.2011.16

BibTeX

Download (PDF)

View

Source

2322

views

We describe computational experiments exploring the performance improvements from overlapping computation and communication on hybrid parallel computers. Our test case is explicit time integration of linear advection with constant uniform velocity in a three-dimensional periodic domain. The test systems include a Cray XT5, a Cray XE6, and two multicore Infiniband clusters with different generations of NVIDIA graphics processing units (GPUs). We describe results for Fortran implementations using various combinations of MPI, OpenMP, and CUDA, with and without overlap of computation and communication. We find that overlapping CPU computation, GPU computation, parallel communication, and CPU-GPU communication can provide performance improvements of more than a factor of two.

Tags: CUDA, Fluid dynamics, Fortran, GPU cluster, Hybrid computing, MPI, nVidia, OpenMP, OpenMPI, Presentation, Tesla C1060, Tesla C2050

November 27, 2011 by hgpu

No votes yet.

Please wait...

* * *

high performance computing on graphics processing units: hgpu.org

Overlapping Computation and Communication for Advection on Hybrid Parallel Computers

Recent source codes

XaaS containers

microSYCL: SYCL micro-benchmarks repository

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

Most viewed papers (last 30 days)

Overlapping Computation and Communication for Advection on Hybrid Parallel Computers

Share this:

Recent source codes

Most viewed papers (last 30 days)