Mixing Multi-Core CPUs and GPUs for Scientific Simulation Software
Institute of Information and Mathematical Sciences, Massey University, Albany, North Shore 102-904, Auckland, New Zealand
Technical Report CSTN-091, Massey University, 2009
@techreport{hawick2009mixing,
title={Mixing Multi-Core CPUs and GPUs for Scientific Simulation Software},
author={Hawick, KA and Leist, A. and Playne, DP},
year={2009},
institution={Technical Report CSTN-091, Computer Science, Massey University}
}
Recent technological and economic developments have led to widespread availability of multi-core CPUs and specialist accelerator processors such as graphical processing units (GPUs). The accelerated computational performance possible from these devices can be very high for some applications paradigms. Software languages and systems such as NVIDIA’s CUDA and Khronos consortium’s open compute language (OpenCL) support a number of individual parallel application programming paradigms. To scale up the performance of some complex systems simulations however, a hybrid of multi-core CPUs for coarse-grained parallelism and very many core GPUs for data parallelism is necessary. We describe our use of hybrid applications using threading approaches and multi-core CPUs to control independent GPU devices to speed up scientific simulations. We present speed-up data and discuss multi-threading software issues for the applications level programmer and offer some suggested areas for language development and integration between coarse-grained and fine grained multi-thread systems. We discuss results from three common simulation algorithmic areas including: partial differential equations; graph cluster metric calculations and random number generation. We report on programming experiences and selected performance for these algorithms on: CUDA programmed single and multiple GPUs; on multi-core CPUs; on a CellBE; and using OpenCL. We discuss programmer usability issues and the outlook and trends in multi-core programming for scientific applications developers.
February 14, 2011 by hgpu