Cache and bandwidth aware matrix multiplication on the GPU
University of Illinois
(2003)
@article{hall2003cache,
title={Cache and bandwidth aware matrix multiplication on the GPU},
author={Hall, J.D. and Carr, N.A. and Hart, J.C.},
year={2003},
publisher={Citeseer}
}
Recent advances in the speed and programmability of consumer level graphics hardware has sparked a flurry of research that goes beyond the realm of image synthesis and computer graphics. We examine the use of the GPU (graphics processing unit) as a tool for scientific computing, by analyzing techniques for performing large matrix multiplies in GPU hardware. An earlier method for multiplying matrices on the GPU su#ered from problems of memory bandwidth. This paper examines more e#cient…
October 27, 2010 by hgpu