high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » The Stencil Processing Unit: GPGPU Done Right

The Stencil Processing Unit: GPGPU Done Right

Sanjay Rajopadhye, Guillaume Iooss, Tomofumi Yuki, Dan Connors

Colorado State University

Colorado State University Technical Report CS-13-103, 2013

@article{rajopadhye2013stencil,

title={The Stencil Processing Unit: GPGPU Done Right},

author={Rajopadhye, Sanjay and Iooss, Guillaume and Yuki, Tomofumi and Connors, Dan},

year={2013}

}

Download (PDF)

View

Source

1558

views

As computing moves to exascale, it will be dominated by energy-efficiency. We propose a new GPU-like accelerator called the Stencil Processing Unit (SPU), for implementing dense stencil computations in an energy-efficient manner. We address all the levels of the programming stack, from architecture, programming API, runtime system and compilation. First, a simple architectural innovation to current GPU architectures enables SPUs to have inter-processor communication between the coarse-grain processors (SMs or TPs). Despite this simplicity, the mere possibility of on-chip communication opens up many challenges, and makes the programming even more difficult than it currently is. We therefore provide a solution to the programming challenge by limiting access to the communication through a disciplined API and with a mechanism that can be statically checked. This allows us to propose simple modifications to existing runtime systems for GPUs to manage the execution of the new API on the SPU architecture. Based on our analytical models, we expect an order of magnitude reductions in the energy cost when stencil codes are implemented on the proposed architecture.

Tags: Computer science, Energy-efficient computing

April 3, 2013 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

The Stencil Processing Unit: GPGPU Done Right

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

The Stencil Processing Unit: GPGPU Done Right

Share this:

Recent source codes

Most viewed papers (last 30 days)