high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Assembly of finite element methods on graphics processors

Assembly of finite element methods on graphics processors

Cris Cecka, Adrian J. Lew, E. Darve

Institute for Computational and Mathematical Engineering, Stanford University, CA, U.S.A.

International Journal for Numerical Methods in Engineering, Volume 85, Issue 5, pages 640-669, 2011

DOI:10.1002/nme.2989

@article{cecka2011assembly,

title={Assembly of finite element methods on graphics processors},

author={Cecka, C. and Lew, A.J. and Darve, E.},

journal={International Journal for Numerical Methods in Engineering},

volume={85},

number={5},

pages={640–669},

year={2011},

publisher={Wiley Online Library}

}

Download (PDF)

View

Source

1265

views

Recently, graphics processing units (GPUs) have had great success in accelerating many numerical computations. We present their application to computations on unstructured meshes such as those in finite element methods. Multiple approaches in assembling and solving sparse linear systems with NVIDIA GPUs and the Compute Unified Device Architecture (CUDA) are created and analyzed. Multiple strategies for efficient use of global, shared, and local memory, methods to achieve memory coalescing, and optimal choice of parameters are introduced. We find that with appropriate preprocessing and arrangement of support data, the GPU coprocessor using single-precision arithmetic achieves speedups of 30 or more in comparison to a well optimized double-precision single core implementation. We also find that the optimal assembly strategy depends on the order of polynomials used in the finite element discretization.

Tags: Computer science, CUDA, FEM, Finite element method, nVidia, nVidia GeForce 8800 GTX, Tesla C1060

November 27, 2011 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

Assembly of finite element methods on graphics processors

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

Assembly of finite element methods on graphics processors

Share this:

Recent source codes

Most viewed papers (last 30 days)