GPU-Accelerated parallel FDTD on Distributed Heterogeneous Platform
Research and Development Department, Shanghai Supercomputer Center, Shanghai 201203, China
International Journal of Antennas and Propagation, 2014
@article{jiang2014gpu,
title={GPU-Accelerated parallel FDTD on Distributed Heterogeneous Platform},
author={Jiang, Ronglin and Jiang, Shugang and Zhang, Yu and Xu, Ying and Xu, Lei and Zhang, Dandan},
year={2014}
}
This paper introduces a (Finite-Difference Time-Domain) FDTD code written in Fortran and CUDA for realistic electromagnetic calculations with parallelization methods of Message Passing Interface (MPI) and Open Multi-Processing (OpenMP). Since both Central Processing Unit (CPU) and Graphics Processing Unit (GPU) resources are utilized, a faster execution speed can be reached compared to a traditional pure GPU code. In our experiments, 64 NVIDIA TESLA K20m GPUs and 64 INTEL XEON E5-2670 CPUs are used to carry out the pure CPU, pure GPU and CPU + GPU tests. Relative to the pure CPU calculations for the same problems, the speedup ratio achieved by CPU + GPU calculations is around 14. Compared to the pure GPU calculations for the same problems, the CPU + GPU calculations have 7.6 % – 13.2 % performance improvement. Because of the small memory size of GPUs, the FDTD problem size is usually very small. However, this code can enlarge the maximum problem size by 25 % without reducing the performance of traditional pure GPU code. Finally, using this code, a microstrip antenna array with 16 x 18 elements is calculated and the radiation patterns are compared with the ones of MoM. Results show that there is a well agreement between them.
January 12, 2014 by hgpu