Optimizing dataflow applications on heterogeneous environments
Dept. of Computer Science, Universidade Federal de Minas Gerais, Brazil
Cluster Computing, Volume 15, Number 2, 125-144, 2012
@article{teodoro2012optimizing,
title={Optimizing dataflow applications on heterogeneous environments},
author={Teodoro, G. and Hartley, T.D.R. and Catalyurek, U.V. and Ferreira, R.},
journal={Cluster Computing},
volume={15},
number={2},
pages={125},
year={2012}
}
The increases in multi-core processor parallelism and in the flexibility of many-core accelerator processors, such as GPUs, have turned traditional SMP systems into hierarchical, heterogeneous computing environments. Fully exploiting these improvements in parallel system design remains an open problem. Moreover, most of the current tools for the development of parallel applications for hierarchical systems concentrate on the use of only a single processor type (e.g., accelerators) and do not coordinate several heterogeneous processors. Here, we show that making use of all of the heterogeneous computing resources can significantly improve application performance. Our approach, which consists of optimizing applications at run-time by efficiently coordinating application task execution on all available processing units is evaluated in the context of replicated dataflow applications. The proposed techniques were developed and implemented in an integrated run-time system targeting both intra- and inter-node parallelism. The experimental results with a real-world complex biomedical application show that our approach nearly doubles the performance of the GPU-only implementation on a distributed heterogeneous accelerator cluster.
June 19, 2012 by hgpu