Auto-Tuning CUDA Parameters for Sparse Matrix-Vector Multiplication on GPUs
Department of Computer Science, University of Wyoming, USA
International Conference on Computational and Information Sciences, 2010, pp. 1154-1157
@conference{guo2010auto,
title={Auto-Tuning CUDA Parameters for Sparse Matrix-Vector Multiplication on GPUs},
author={Guo, P. and Wang, L.},
booktitle={2010 International Conference on Computational and Information Sciences},
pages={1154–1157},
year={2010},
organization={IEEE}
}
Graphics Processing Unit (GPU) has become an attractive coprocessor for scientific computing due to its massive processing capability. The sparse matrix-vector multiplication (SpMV) is a critical operation in a wide variety of scientific and engineering applications, such as sparse linear algebra and image processing. This paper presents an auto-tuning framework that can automatically compute and select CUDA parameters for SpMV to obtain the optimal performance on specific GPUs. The framework is evaluated on two NVIDIA GPU platforms, GeForce 9500 GTX and GeForce GTX 295.
April 21, 2011 by hgpu