945

Cache and bandwidth aware matrix multiplication on the GPU

J. Hall, N. Carr, J. Hart
University of Illinois
(2003)
BibTeX

Download Download (PDF)   View View   Source Source   

1778

views

Recent advances in the speed and programmability of consumer level graphics hardware has sparked a flurry of research that goes beyond the realm of image synthesis and computer graphics. We examine the use of the GPU (graphics processing unit) as a tool for scientific computing, by analyzing techniques for performing large matrix multiplies in GPU hardware. An earlier method for multiplying matrices on the GPU su#ered from problems of memory bandwidth. This paper examines more e#cient…
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org