Atomic-free Irregular Computations on GPUs
The University of Texas Austin
6th Workshop on General Purpose Processor Using Graphics Processing Units (GPGPU-6), 2013
@inproceedings{nasre2013atomic,
title={Atomic-free irregular computations on GPUs},
author={Nasre, Rupesh and Burtscher, Martin and Pingali, Keshav},
booktitle={Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units},
pages={96–107},
year={2013},
organization={ACM}
}
Atomic instructions are a key ingredient of codes that operate on irregular data structures like trees and graphs. It is well known that atomics can be expensive, especially on massively parallel GPUs, and are often on the critical path of a program. In this paper, we present two high-level methods to eliminate atomics in irregular programs. The first method advocates synchronous processing using barriers. We illustrate how to exploit synchronous processing to avoid atomics even when the threads’ memory accesses conflict with each other. The second method is based on exploiting algebraic properties of algorithms to elide atomics. Specifically, we focus on three key properties: monotonicity, idempotency and associativity, and show how each of them enables an atomic-free implementation. We illustrate the generality of the two methods by applying them to five irregular graph applications: breadth-first search, single-source shortest paths computation, Delaunay mesh refinement, pointer analysis and survey propagation, and show that both methods provide substantial speedup in each case on different GPUs.
April 7, 2013 by hgpu