Effective GPU Strategies for LU Decomposition

Dinesh Bandara, Nalin Ranasinghe
University of Colombo School of Computing, Sri Lanka
HiPC 2011 Student Research Symposium, 2011


   title={Effective GPU Strategies for LU Decomposition},

   author={Bandara, H. and Ranasinghe, DN},



Download Download (PDF)   View View   Source Source   



GPUs are becoming an attractive computing platform not only for traditional graphics computation but also for general-purpose computation because of the computational power, programmability and comparatively low cost of modern GPUs. This has lead to a variety of complex GPGPU applications with significant performance improvements. The LU decomposition represents a fundamental step in many computationally intensive scientific applications and it is often the costly step in the solution process because of the impact of size of the matrix. In this paper we implement three different variants of the LU decomposition algorithm on a Tesla C1060 and the most significant LU decomposition that fits the highly parallel architecture of modern GPUs is found to be Update through Column with shared memory access implementation.
No votes yet.
Please wait...

* * *

* * *

* * *

HGPU group © 2010-2022 hgpu.org

All rights belong to the respective authors

Contact us: