high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Optimizing GPU to GPU Communication on Cray XK7

Optimizing GPU to GPU Communication on Cray XK7

Jeff M. Larkin

NVIDIA, Santa Clara, CA, USA

A New Vintage of Computing (CUG2013), 2013

@article{larkin2013optimizing,

title={Optimizing GPU to GPU Communication on Cray XK7},

author={Larkin, Jeff M},

year={2013}

}

Download (PDF)

View

Source

1989

views

When developing an application for Cray XK7 systems, optimization of compute kernels is only a small part of maximizing scaling and performance. Programmers must consider the effect of the GPU’s distinct address space and the PCIe bus on application scalability. Without such considerations applications rapidly become limited by transfers to and from the GPU and fail to scale to large numbers of nodes. This paper will demonstrate methods for optimizing GPU to GPU communication and present XK7 results for these methods.

Tags: Computer science, CUDA, MPI, nVidia, OpenACC, Tesla K20

December 19, 2013 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

high performance computing on graphics processing units: hgpu.org

Optimizing GPU to GPU Communication on Cray XK7

Your response

Recent source codes

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

CuTile Benchmark Suite: Performance and Productivity Tradeoffs for GPU Kernel Programming on Blackwell Architecture

Agentic Code Optimization via Compiler-LLM Cooperation

Most viewed papers (last 30 days)

Optimizing GPU to GPU Communication on Cray XK7

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)