Optimizing GPU to GPU Communication on Cray XK7

Jeff M. Larkin
NVIDIA, Santa Clara, CA, USA
A New Vintage of Computing (CUG2013), 2013


   title={Optimizing GPU to GPU Communication on Cray XK7},

   author={Larkin, Jeff M},



Download Download (PDF)   View View   Source Source   



When developing an application for Cray XK7 systems, optimization of compute kernels is only a small part of maximizing scaling and performance. Programmers must consider the effect of the GPU’s distinct address space and the PCIe bus on application scalability. Without such considerations applications rapidly become limited by transfers to and from the GPU and fail to scale to large numbers of nodes. This paper will demonstrate methods for optimizing GPU to GPU communication and present XK7 results for these methods.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: