A uniform approach for programming distributed heterogeneous computing systems
Institute of Computer Science, University of Innsbruck, Austria
Journal of Parallel and Distributed Computing, 2014
@article{grasso2014uniform,
title={A uniform approach for programming distributed heterogeneous computing systems},
author={Grasso, Ivan and Pellegrini, Simone and Cosenza, Biagio and Fahringer, Thomas},
journal={Journal of Parallel and Distributed Computing},
year={2014},
publisher={Elsevier}
}
Large-scale compute clusters of heterogeneous nodes equipped with multi-core CPUs and GPUs are getting increasingly popular in the scientific community. However, such systems require a combination of different programming paradigms making application development very challenging. In this article we introduce libWater, a library-based extension of the OpenCL programming model that simplifies the development of heterogeneous distributed applications. libWater consists of a simple interface, which is a transparent abstraction of the underlying distributed architecture, offering advanced features such as inter-context and inter-node device synchronization. It provides a runtime system which tracks dependency information enforced by event synchronization to dynamically build a DAG of commands, on which we automatically apply two optimizations: collective communication pattern detection and device-host-device copy removal. We assess libWater’s performance in three compute clusters available from the Vienna Scientific Cluster, the Barcelona Supercomputing Center and the University of Innsbruck, demonstrating improved performance and scaling with different test applications and configurations.
September 4, 2014 by hgpu