17230

Accelerating solutions of PDEs with GPU-based swept time-space decomposition

Daniel J Magee, Kyle E Niemeyer
School of Mechanical, Industrial, and Manufacturing Engineering, Oregon State University, Corvallis, OR
arXiv:1705.03162 [physics.comp-ph], (9 May 2017)

@article{magee2017accelerating,

   title={Accelerating solutions of PDEs with GPU-based swept time-space decomposition},

   author={Magee, Daniel J and Niemeyer, Kyle E},

   year={2017},

   month={may},

   archivePrefix={"arXiv"},

   primaryClass={physics.comp-ph}

}

The expedient design of precision components in aerospace and other high-tech industries requires simulations of physical phenomena often described by partial differential equations (PDEs) without exact solutions. Modern design problems require simulations with a level of resolution difficult to achieve in reasonable amounts of time—even in effectively parallelized solvers. Though the scale of the problem relative to available computing power is the greatest impediment to accelerating these applications, significant performance gains can be achieved through careful attention to the details of memory communication and access. The swept time-space decomposition rule reduces communication between subdomains by exhausting the domain of influence before communicating boundary values. Here we present a GPU implementation of the swept rule, which modifies the algorithm for improved performance on this processing architecture by prioritizing use of private (shared) memory, avoiding interblock communication, and overwriting unnecessary values. It shows significant improvement in the execution time of one-dimensional, finite-difference PDE solvers for scalar equations, producing speedups of 2-9x for a range of problem sizes, respectively, compared with simple GPU versions and 7-300x compared with parallel CPU versions. However, for a more sophisticated one-dimensional system of equations discretized with a second-order finite-volume scheme, the swept rule performs 1.2-1.9x worse than a basic implementation for all problem sizes.
Rating: 2.7/5. From 3 votes.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: