Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems
University of Manchester, Manchester, UK
Supercomputing frontiers and innovations, Vol. 2, No. 4, pp. 67-86, 2015
@article{abalenkovs2015parallel,
title={Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems},
author={Abalenkovs, M and Abdelfattah, A and Dongarra, J and Gates, M and Haidar, A and Kurzak, J and Luszczek, P and Tomov, S and Yamazaki, I and YarKhan, A},
year={2015}
}
We present a review of the current best practices in parallel programming models for dense linear algebra (DLA) on heterogeneous architectures. We consider multicore CPUs, stand alone manycore coprocessors, GPUs, and combinations of these. Of interest is the evolution of the programming models for DLA libraries – in particular, the evolution from the popular LAPACK and ScaLAPACK libraries to their modernized counterparts PLASMA (for multicore CPUs) and MAGMA (for heterogeneous architectures), as well as other programming models and libraries. Besides providing insights into the programming techniques of the libraries considered, we outline our view of the current strengths and weaknesses of their programming models – especially in regards to hardware trends and ease of programming high-performance numerical software that current applications need – in order to motivate work and future directions for the next generation of parallel programming models for high-performance linear algebra libraries on heterogeneous systems.
April 19, 2016 by hgpu