GPU-accelerated WZ Factorization with the Use of the CUBLAS Library

Beata Bylina, Jaroslaw Bylina
Institute of Mathematics, Marie Curie-Sklodowska University, Pl. M. Curie-Sklodowskiej 5, 20-031 Lublin, Poland
Preprints of the Federated Conference on Computer Science and Information Systems pp. 537-543, 2012


   title={GPU-accelerated WZ Factorization with the Use of the CUBLAS Library},

   author={Bylina, Beata and Bylina, Jaroslaw},



Download Download (PDF)   View View   Source Source   



We present a novel implementation of a dense, square, non-structured matrix factorization algorithm, namely the WZ factorization – with the use of graphics processors (GPUs) and CPUs to gain a high performance at a low cost. We rewrite this factorization as operations on blocks of matrices and vectors. We have implemented our block-vector algorithm on GPUs with the use of an appropriate (and ready-to-use) GPU-accelerated mathematical library, namely the CUBLAS library. We compared the performance of our algorithm with CPU implementations. In particular, our implementation on an NVIDIA Tesla C2050 GPU outperforms a CPU-based implementation. Our results show that the algorithm scales well with the size of matrices; moreover, the larger the matrix, the better the performance. We also discuss the impact of the size of the matrix and the use of ready-to-use mathematical libraries on the numerical accuracy.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: