Molecular dynamics simulations with many-body potentials on multiple GPUs – the implementation, package and performance

hgpu.org » Programming » CUDA » Molecular dynamics simulations with many-body potentials on multiple GPUs – the implementation, package and performance

Molecular dynamics simulations with many-body potentials on multiple GPUs – the implementation, package and performance

Qing Hou, Min Li, Yulu Zhou, Jiechao Cui, Zhenguo Cui, Jun Wang

Key Lab of Radiation Physics and Technology, Institute of Nuclear Science and Technology, Sichuan University, Chengdu 610064, China

arXiv:1212.6332 [physics.comp-ph], (27 Dec 2012)

@article{2012arXiv1212.6332H,

author={Hou}, Q. and {Li}, M. and {Zhou}, Y. and {Cui}, J. and {Cui}, Z. and {Wang}, J.},

title={"{Molecular dynamics simulations with many-body potentials on multiple GPUs – the implementation, package and performance}"},

journal={ArXiv e-prints},

archivePrefix={"arXiv"},

eprint={1212.6332},

primaryClass={"physics.comp-ph"},

keywords={Physics – Computational Physics},

year={2012},

month={dec},

adsurl={http://adsabs.harvard.edu/abs/2012arXiv1212.6332H},

adsnote={Provided by the SAO/NASA Astrophysics Data System}

}

Download (PDF)

View

Source

2394

views

Molecular dynamics (MD) is an important research tool extensively applied in materials science. Running MD on a graphics processing unit (GPU) is an attractive new approach for accelerating MD simulations. Currently, GPU implementations of MD usually run in a one-host-process-one-GPU (OHPOG) scheme. This scheme may pose a limitation on the system size that an implementation can handle due to the small device memory relative to the host memory. In this paper, we present a one-host-process-multiple-GPU (OHPMG) implementation of MD with embedded-atom-model or semi-empirical tight-binding many-body potentials. Because more device memory is available in an OHPMG process, the system size that can be handled is increased to a few million or more atoms. In comparison with the CPU implementation, in which Newton’s third law is applied to improve the computational efficiency, our OHPMG implementation has achieved a 28.9x~86.0x speedup in double precision, depending on the system size, the cut-off ranges and the number of GPUs. The implementation can also handle a group of small boxes in one run by combining the small boxes into a large box. This approach greatly improves the GPU computing efficiency when a large number of MD simulations for small boxes are needed for statistical purposes.

Tags: Computational Physics, CUDA, GPU cluster, Materials Science, Molecular dynamics, nVidia, Physics, Tesla C2050

January 2, 2013 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org