Parallel Implementation of Vortex Element Method on CPUs and GPUs
Bauman Moscow State Technical University, Moscow, Russia
Procedia Computer Science, Volume 66, Pages 73-82, 2015
@article{kuzmina2015parallel,
title={Parallel Implementation of Vortex Element Method on CPUs and GPUs},
author={Kuzmina, Kseniia and Marchevsky, Ilia and Moreva, Victoriya},
journal={Procedia Computer Science},
volume={66},
pages={73–82},
year={2015},
publisher={Elsevier}
}
The implementations of 2D vortex element method adapted to different types of parallel computers are considered. The developed MPI-implementation provides close to linear acceleration for small number of computational cores and approximately 40-times acceleration for 80-cores cluster when solving model problem. OpenMP-based modification allows to obtain 5% additional acceleration due to shared memory usage. Approximate fast multipole method usage reduces time of computations significantly: 11 times for the testmodel problem in sequential mode and 3.5 times in parallel mode for 16-cores cluster. The most efficient implementation of vortex element method is developed for GPUs using NVidia CUDA technology. Time of the model problem solving using single GeForce GTX 970 or Tesla C2070 accelerator is comparable with time of its solving on cluster when involving 30-40 cores of Intel Xeon E5450 CPUs.
December 6, 2015 by hgpu