Optimization strategies for parallel CPU and GPU implementations of a meshfree particle method
EPHYSLAB Environmental Physics Laboratory, Universidade de Vigo, Campus As Lagoas s/n, 32004, Ourense, Spain
arXiv:1110.3711v1 [cs.PF] (17 Oct 2011)
@article{2011arXiv1110.3711D,
author={Dom{‘{i}}nguez}, J.~M. and {Crespo}, A.~J.~C. and {G{‘o}mez-Gesteira}, M.},
title={"{Optimization strategies for parallel CPU and GPU implementations of a meshfree particle method}"},
journal={ArXiv e-prints},
archivePrefix={"arXiv"},
eprint={1110.3711},
primaryClass={"cs.PF"},
keywords={Computer Science – Performance, Computer Science – Computational Engineering, Finance, and Science, 68Uxx},
year={2011},
month={oct},
adsurl={http://adsabs.harvard.edu/abs/2011arXiv1110.3711D},
adsnote={Provided by the SAO/NASA Astrophysics Data System}
}
Much of the current focus in high performance computing (HPC) for computational fluid dynamics (CFD) deals with grid based methods. However, parallel implementations for new meshfree particle methods such as Smoothed Particle Hydrodynamics (SPH) are less studied. In this work, we present optimizations for both central processing unit (CPU) and graphics processing unit (GPU) of a SPH method. These optimization strategies can be further applied to many other meshfree methods. The obtained performance for each architecture and a comparison between the most efficient implementations for CPU and GPU are shown.
October 18, 2011 by hgpu