Frameworks for multi-core architectures: a comprehensive evaluation using 2D/3D image registration
Hardware/Software Co-Design, Department of Computer Science, University of Erlangen-Nuremberg, Germany
Architecture of Computing Systems – ARCS 2011, Lecture Notes in Computer Science, 2011, Volume 6566/2011, 62-73
@article{membarth2011frameworks,
title={Frameworks for multi-core architectures: a comprehensive evaluation using 2D/3D image registration},
author={Membarth, R. and Hannig, F. and Teich, J. and K{\"o}rner, M. and Eckert, W.},
journal={Architecture of Computing Systems-ARCS 2011},
pages={62–73},
year={2011},
publisher={Springer}
}
The development of standard processors changed in the last years moving from bigger, more complex, and faster cores to putting several more simple cores onto one chip. This changed also the way programs are written in order to leverage the processing power of multiple cores of the same processor. In the beginning, programmers had to divide and distribute the work by hand to the available cores and to manage threads in order to use more than one core. Today, several frameworks exist to relieve the programmer from such tasks. In this paper, we present five such frameworks for parallelization on shared memory multi-core architectures, namely OpenMP, Cilk++, Threading Building Blocks, RapidMind, and OpenCL. To evaluate these frameworks, a real world application from medical imaging is investigated, the 2D/3D image registration. In an empirical study, a fine-grained data parallel and a coarse-grained task parallel parallelization approach are used to evaluate and estimate different aspects like usability, performance, and overhead of each framework.
August 19, 2011 by hgpu