high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation

High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation

Imre Kiss, Zsolt Badics, Szabolcs Gyimothy, Jozsef Pavo

Budapest University of Technology and Economics, H-1521 Budapest Hungary

IEEE High Performance Extreme Computing Conference(HPEC ’12), 2012

BibTeX

Download (PDF)

View

Source

2230

views

The utilization of Graphical Processing Units (GPUs) for the element-by-element (EbE) finite element method (FEM) is demonstrated. EbE FEM is a long known technique, by which a conjugate gradient (CG) type iterative solution scheme can be entirely decomposed into computations on the element level, i.e., without assembling the global system matrix. In our implementation, NVIDIA’s parallel computing solution, the Compute Unified Device Architecture (CUDA), is used to perform the required element-wise computations in parallel. Since element matrices need not be stored, the memory requirement can be kept extremely low. It is shown that this low-storage but computation-intensive technique is better suited for GPUs than those requiring the massive manipulation of large data sets. This study of the proposed parallel model illustrates a highly improved locality and minimization of data movement, which could also significantly reduce energy consumption in other heterogeneous HPC architectures.

Tags: Computer science, CUDA, FEM, Finite element method, Heterogeneous systems, nVidia, nVidia GeForce GTX 590

November 18, 2012 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation

Your response

Recent source codes

Specx: Speculative task-based runtime system

Mutual-Supervised Learning for Sequential-to-Parallel Code Translation

KISim: Kubernetes Intelligent Scheduling Simulator

Hardware Compute Partitioning on NVIDIA GPUs for Composable Systems

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

exa-AMD: Exascale Accelerated Materials Discovery

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

Most viewed papers (last 30 days)

High locality and increased intra-node parallelism for solving finite element models on GPUs by novel element-by-element implementation

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)