Comparing Many-Core Accelerator Frameworks
Institute for Mathematics and Scientific Computing, University of Graz, 8061 Graz, Austria
University of Graz, 2012
@article{haase2012comparing,
title={Comparing Many–Core Accelerator Frameworks},
author={Haase, G. and Kucher, A.},
year={2012}
}
GPUs as general purpose processors already are well adopted in scientific and high performance computing. Their steadily increasing success caused others than GPU hardware vendors to work on many{core processors as hardware accelerators. With CUDA and OpenCL there are two frameworks available for GPU programming. Apart from potential compatibility problems with the upcoming hardware, both frameworks share a common disadvantage. It is hard to program them efficiently and it can be even harder to maintain them in existing large applications. PGI Accelerator and HMPP Workbench are two frameworks with an abstract programming model, similar to OpenMP, that allow the porting of existing sequential codes by means of preprocessor directives. Depending on the target architecture, i.e., the hardware accelerator, a code generator uses these directives to generate hardware accelerator code. In this technical report we will present these frameworks and evaluate them in terms of performance and applicability. It will turn out, that PGI Accelerator and HMPP Workbench give similar performance results. The code generator of PGI Accelerator can perform a number of optimization strategies automatically, but HMPP Workbench is more sophisticated regarding the spectrum of target architectures and the applicability to already existing codes.
February 15, 2012 by hgpu