high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Improving the Efficiency of GPU Clusters

Improving the Efficiency of GPU Clusters

Bill Mcmillan

Platform Computing

HPC Day (High Performance Computing Day), Stanford University, 2010

@misc{mcmillan2010improving,

title={Improving the Efficiency of GPU Clusters},

author={Mcmillan, Bill},

year={2010}

}

Download (PDF)

View

Source

1239

views

If you perceive more than a little excitement around the topic of Graphic Processing Units (GPUs) in High-Performance Computing (HPC), it’s for pretty good reason. HPC is all about performance after all, and it’s not every day that a new technology promises an order of magnitude boost in processing power. A variety of new GPU processing elements are rapidly finding their way into HPC clusters as developers scramble to rearticulate algorithms to take maximum advantage of these new capabilities. Not only is the industry taking note of performance, but the economics are compelling too. In data centers where facilities, power and cooling dominate the cost of the compute infrastructure itself, achieving better density and performance to power ratios warrants serious attention. Achieving real-world performance is about much more than just the raw-performance of underlying hardware as we all know. Much as a highly-efficient power plant connected to a distribution network losing 70% of its power in transmission makes little sense, the same applies to HPC clusters as well – efficiency matters. While many factors impact efficiency, this paper focuses on the critical role of scheduling and workload management in getting the most out of your GPU cluster. By “working smarter”, and enabling GPU clusters with dramatically higher utilization and throughput, not only can organizations achieve savings in infrastructure and management costs, they can boost productivity as well.

Tags: Computer science, GPU cluster, Review

March 12, 2011 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

* * *

high performance computing on graphics processing units: hgpu.org

Improving the Efficiency of GPU Clusters

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)

Improving the Efficiency of GPU Clusters

Share this:

Recent source codes

Most viewed papers (last 30 days)