Accelerating the Sweep3D for a Graphic Processor Unit
Department of Computer Sciences, National University of Defense Technology, Changsha, P.R. China
The Journal of Information Processing Systems, Volume 7, Number 1 (March 2011), p.63-74
@article{gong2011accelerating,
title={Accelerating the Sweep3D for a Graphic Processor Unit},
author={Gong, C. and Liu, J. and Chen, H. and Xie, J. and Gong, Z.},
journal={The Journal of Information Processing Systems},
volume={7},
issn={1976-913X},
pages={63–74},
year={2011}
}
As a powerful and flexible processor, the Graphic Processing Unit (GPU) can offer a great faculty in solving many high-performance computing applications. Sweep3D, which simulates a single group time-independent discrete ordinates (Sn) neutron transport deterministically on 3D Cartesian geometry space, represents the key part of a real ASCI application. The wavefront process for parallel computation in Sweep3D limits the concurrent threads on the GPU. In this paper, we present multi-dimensional optimization methods for Sweep3D, which can be efficiently implemented on the finegrained parallel architecture of the GPU. Our results show that the overall performance of Sweep3D on the CPU-GPU hybrid platform can be improved up to 4.38 times as compared to the CPU-based implementation.
May 25, 2011 by hgpu