high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » CUDA-Lite: Reducing GPU programming complexity

CUDA-Lite: Reducing GPU programming complexity

Sain-zee Ueng, Melvin Lathara, Sara S. Baghsorkhi, Wen-mei W. Hwu

Center for Reliable and High-Performance Computing, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign

In In: LCPC’08. Volume 5335 of LNCS (2008), pp. 1-15.

DOI:10.1007/978-3-540-89740-8_1

@article{ueng2008cuda,

title={CUDA-lite: Reducing GPU programming complexity},

author={Ueng, S.Z. and Lathara, M. and Baghsorkhi, S. and Hwu, W.},

journal={Languages and Compilers for Parallel Computing},

pages={1–15},

year={2008},

publisher={Springer}

}

Download (PDF)

View

Source

2290

views

Abstract. The computer industry has transitioned into multi-core and many-core parallel systems. The CUDA programming environment from NVIDIA is an attempt to make programming many-core GPUs more accessible to programmers. However, there are still many burdens placed upon the programmer to maximize performance when using CUDA. One such burden is dealing with the complex memory hierarchy. Efficient and correct usage of the various memories is essential, making a difference of 2-17x in performance. Currently, the task of determining the appropriate memory to use and the coding of data transfer between memories is still left to the programmer. We believe that this task can be better performed by automated tools. We present CUDA-lite, an enhancement to CUDA, as one such tool. We leverage programmer knowledge via annotations to perform transformations and show preliminary results that indicate auto-generated code can have performance comparable to hand coding.

Tags: Computer science, CUDA, nVidia, nVidia GeForce 8800 GTX, Programming techniques

November 1, 2010 by hgpu

No votes yet.

Please wait...