A note on the GPU acceleration of eigenvalue computations
Argonne National Laboratory, 9700 South Cass Avenue, Argonne, IL 60439, USA
International Conference of Numerical Analysis and Applied Mathematics (ICNAAM), 2013
@article{rupp2013note,
title={A note on the GPU acceleration of eigenvalue computations},
author={Rupp, K and Tillet, Ph and Smith, BF and Grasser, T and J{"u}ngel, A},
year={2013}
}
Eigenvalue computations for large sparse matrices such as the Lanczos method are commonly based on Krylov subspace techniques. One of the dominant operations in such algorithms are iterated computations of inner products with the same vector in order to preserve orthogonality of the Krylov basis. These operations can be accelerated by existing BLAS functionality using GPUs. However, this is not fully efficient due to unnecessary memory transfers. We present improved implementations in CUDA and OpenCL, which are now available in ViennaCL, PETSc and SLEPc, and demonstrate an up to two-fold performance gain over existing GPU vendor libraries.
August 1, 2013 by hgpu