Assembly of finite element methods on graphics processors

Cris Cecka, Adrian J. Lew, E. Darve
Institute for Computational and Mathematical Engineering, Stanford University, CA, U.S.A.
International Journal for Numerical Methods in Engineering, Volume 85, Issue 5, pages 640-669, 2011


   title={Assembly of finite element methods on graphics processors},

   author={Cecka, C. and Lew, A.J. and Darve, E.},

   journal={International Journal for Numerical Methods in Engineering},





   publisher={Wiley Online Library}


Download Download (PDF)   View View   Source Source   



Recently, graphics processing units (GPUs) have had great success in accelerating many numerical computations. We present their application to computations on unstructured meshes such as those in finite element methods. Multiple approaches in assembling and solving sparse linear systems with NVIDIA GPUs and the Compute Unified Device Architecture (CUDA) are created and analyzed. Multiple strategies for efficient use of global, shared, and local memory, methods to achieve memory coalescing, and optimal choice of parameters are introduced. We find that with appropriate preprocessing and arrangement of support data, the GPU coprocessor using single-precision arithmetic achieves speedups of 30 or more in comparison to a well optimized double-precision single core implementation. We also find that the optimal assembly strategy depends on the order of polynomials used in the finite element discretization.
No votes yet.
Please wait...

You must be logged in to post a comment.

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: