GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement
Karlsruhe Institute of Technology, Karlsruhe, Germany
University of Tennessee Computer Science Technical Report UT-CS-11-690 (also Lawn 260), 2011
@article{anzt2011gpu,
title={GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement},
author={Anzt, H. and Luszczek, P. and Dongarra234, J. and Heuveline, V.},
year={2011}
}
In hardware-aware high performance computing, block- asynchronous iteration and mixed precision iterative refinement are two techniques that are applied to leverage the computing power of SIMD accelerators like GPUs. Although they use a very different approach for this purpose, they share the basic idea of compensating the convergence behaviour of an inferior numerical algorithm by a more efficient usage of the provided computing power. In this paper, we want to analyze the potential of combining both techniques. Therefore, we implement a mixed precision iterative refinement algorithm using a block-asynchronous iteration as an error correction solver, and compare its performance with a pure implementation of a block-asynchronous iteration and an iterative refinement method using double precision for the error correction solver. For matrices from the University of Florida Matrix collection, we report the convergence behaviour and provide the total solver runtime using different GPU architectures.
January 6, 2012 by hgpu