OpenACC-based GPU Acceleration of a 3-D Unstructured Discontinuous Galerkin Method
Department of Mechanical and Aerospace Engineering, North Carolina State University, Raleigh, NC 27695-7910, United States
AIAA Aerospace Sciences Meeting, 2014
@article{xia2014openacc,
title={OpenACC-based GPU Acceleration of a 3-D Unstructured Discontinuous Galerkin Method},
author={Xia, Yidong and Luo, Hong and Luo, Lixiang and Edwards, Jack and Lou, Jialin and Mueller, Frank},
year={2014}
}
A GPU-accelerated discontinuous Galerkin (DG) method is presented for the solution of compressible flows on 3-D unstructured grids. The present work has employed two of the most attractive features in a new programming standard of parallel computing – OpenACC: 1) multi-platform/compiler support and 2) descriptive directive interface to upgrade a legacy CFD solver with the capacity of GPU computing, without significant extra cost in recoding, resulting in a highly portable and extensible GPU-accelerated code. In addition, a face renumbering/grouping scheme is proposed to overcome the "race condition" in face-based flux calculations that occurs on GPU vectorization. Performance of the developed double-precision solver is assessed for both simple and complex geometries. Speedup factors up to but not limited to 24x and 1.6x were achieved by comparing the measured computing time of the OpenACC program running on an NVIDIA Tesla K20c GPU to that of the equivalent MPI program running on one single core and full sixteen cores of an AMD Opteron-6128 CPU respectively, indicating a great potential to port more features of the underlying DG solver into the OpenACC framework.
February 5, 2014 by hgpu