Cache and bandwidth aware matrix multiplication on the GPU

J. Hall, N. Carr, J. Hart
University of Illinois


   title={Cache and bandwidth aware matrix multiplication on the GPU},

   author={Hall, J.D. and Carr, N.A. and Hart, J.C.},




Download Download (PDF)   View View   Source Source   



Recent advances in the speed and programmability of consumer level graphics hardware has sparked a flurry of research that goes beyond the realm of image synthesis and computer graphics. We examine the use of the GPU (graphics processing unit) as a tool for scientific computing, by analyzing techniques for performing large matrix multiplies in GPU hardware. An earlier method for multiplying matrices on the GPU su#ered from problems of memory bandwidth. This paper examines more e#cient…
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: