The TheLMA project: Multi-GPU implementation of the lattice Boltzmann method
EDF R&D, Departement EnerBAT, Moret-sur-Loing Cedex, France
International Journal of High Performance Computing Applications, August 2011, vol. 25, no. 3, 295-303
@article{obrecht2011thelma,
title={The TheLMA project: Multi-GPU implementation of the lattice Boltzmann method},
author={Obrecht, C. and Kuznik, F. and Tourancheau, B. and Roux, J.J.},
journal={International Journal of High Performance Computing Applications},
year={2011},
publisher={SAGE Publications}
}
In this paper, we describe the implementation of a multi-graphical processing unit (GPU) fluid flow solver based on the lattice Boltzmann method (LBM). The LBM is a novel approach in computational fluid dynamics, with numerous interesting features from a computational, numerical, and physical standpoint. Our program is based on CUDA and uses POSIX threads to manage multiple computation devices. Using recently released hardware, our solver may therefore run eight GPUs in parallel, which allows us to perform simulations at a rather large scale. Performance and scalability are excellent, the speedup over sequential implementations being at least of two orders of magnitude. In addition, we discuss tiling and communication issues for present and forthcoming implementations.
September 19, 2011 by hgpu