high performance computing on graphics processing units: hgpu.org

Applications

hgpu.org » Applications » Fluid dynamics

TRUST: the HPC open-source CFD platform – from CPU to GPU

Elie Saikali∗ , Adrien Bruneton, Pierre Ledac, Remi Bourgeois

View

Download (PDF)

Source codes

Tags: AMD Radeon Instinct MI250A, AMD Radeon Instinct MI300A, ATI, cfd, CUDA, Fluid dynamics, HIP, MPI, Numerical simulation, nVidia, nVidia A100, OpenMP, Package

September 28, 2025 by hgpu

GPU-acceleration of the Discontinuous Galerkin Shallow Water Equations Solver (DG-SWEM) using CUDA and OpenACC

Chayanon Wichitrnithed, Eirik Valseth, Clint Dawson

View

Download (PDF)

Source codes

Tags: cfd, Computational Physics, CUDA, Data parallelism, Fluid dynamics, Fortran, MPI, nVidia, nVidia GH200, OpenACC, Package, Physics

September 7, 2025 by hgpu

miniLB: A Performance Portability Study of Lattice-Boltzmann Simulations

Luigi Crisci, Biagio Cosenza, Giorgio Amati, Matteo Turisini

View

Download (PDF)

Source codes

Tags: AMD Radeon Instinct MI100, ATI, cfd, Fluid dynamics, nVidia, Package, SYCL, Tesla V100

September 29, 2024 by hgpu

OpenACC offloading of the MFC compressible multiphase flow solver on AMD and NVIDIA GPUs

Benjamin Wilfong, Anand Radhakrishnan, Henry A. Le Berre, Steve Abbott, Reuben D. Budiardja, Spencer H. Bryngelson

View

Download (PDF)

Source codes

Tags: AMD Radeon Instinct MI250X, ATI, cfd, Fluid dynamics, MPI, nVidia, OpenACC, Package, Tesla V100

September 29, 2024 by hgpu

In-Situ Techniques on GPU-Accelerated Data-Intensive Applications

Yi Ju, Mingshuai Li, Adalberto Perez, Laura Bellentani, Niclas Jansson, Stefano Markidis, Philipp Schlatter, Erwin Laure

View

Download (PDF)

Tags: Computer science, CUDA, Fluid dynamics, HPC, Molecular dynamics, nVidia, nVidia A100, PC cluster, Performance

August 14, 2024 by hgpu

Direct Numerical Simulation of Turbulence on Heterogenous Computer Systems: Architectures, Algorithms, and Applications

Martin Karp

View

Download (PDF)

Tags: AMD Radeon Instinct MI250X, ATI, cfd, Fluid dynamics, FPGA, Heterogeneous systems, Numerical simulation, nVidia, nVidia A100, nVidia P100, nVidia V100, OpenCL, Spectral elements, Thesis

May 12, 2024 by hgpu

Porting HPC Applications to AMD Instinct MI300A Using Unified Memory and OpenMP

Suyash Tandon, Leopold Grinberg, Gheorghe-Teodor Bercea, Carlo Bertolli, Mark Olesen, Simone Bnà, Nicholas Malaya

View

Download (PDF)

Tags: AMD Radeon Instinct MI210, AMD Radeon Instinct MI300A, ATI, cfd, Computer science, Fluid dynamics, HPC, nVidia, nVidia A100, nVidia H100, OpenMP

May 5, 2024 by hgpu

OpenMP offload at the Exascale using Intel GPU Max 1550: evaluation of STREAmS compressible solver

Francesco Salvadore, Giacomo Rossi, Srikanth Sathyanarayana, Matteo Bernardini

View

Download (PDF)

Tags: AMD Radeon Instinct MI250X, ATI, Benchmarking, cfd, Compression, Fluid dynamics, Intel, Intel Data Center GPU Max 1550, nVidia, nVidia A100, OpenMP

April 14, 2024 by hgpu

Speed, power and cost implications for GPU acceleration of Computational Fluid Dynamics on HPC systems

Zachary Cooper-Baldock, Brenda Vara Almirall, Kiao Inthavong

View

Download (PDF)

Tags: cfd, Fluid dynamics, HPC, MPI, nVidia, nVidia A100, nVidia V100, Performance

April 7, 2024 by hgpu

Using AI libraries for Incompressible Computational Fluid Dynamics

Boyang Chen, Claire E. Heaney, Christopher C. Pain

View

Download (PDF)

Tags: AI, Artificial intelligence, cfd, CUDA, Fluid dynamics, Neural networks, nVidia, Partial differential equations, PDEs, Tesla T4

March 3, 2024 by hgpu

High-order thread-safe lattice Boltzmann model for HPC turbulent flow simulations

Andrea Montessori, Michele La Rocca, Giorgio Amati, Marco Lauricella, Adriano Tiribocchi, Sauro Succi

View

Download (PDF)

Source codes

Tags: cfd, CUDA, Fluid dynamics, HPC, Lattice Boltzmann model, nVidia, nVidia A100, nVidia GeForce RTX 3090, Package

February 4, 2024 by hgpu

Optimization of Ported CFD Kernels on Intel Data Center GPU Max 1550 using oneAPI ESIMD

Mohammad Zubair, Aaron Walden, Gabriel Nastac, Eric Nielsen, Christoph Bauinger, Xiao Zhu

View

Download (PDF)

Tags: cfd, CUDA, Fluid dynamics, Intel, Intel Data Center GPU Max 1550, nVidia, nVidia A100, oneAPI, Performance, SYCL

December 31, 2023 by hgpu

ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation

SeedFold: Scaling Biomolecular Structure Prediction

Tilus: A Tile-Level GPU Kernel Programming Language

Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation

Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs

BoltzGen:Toward Universal Binder Design

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution

MATLAB Tensor Core models

Accurate Models of NVIDIA Tensor Cores

TritonForge: Transform PyTorch Operations into Optimized GPU Kernels with LLMs

TritonForge: Profiling-Guided Framework for Automated Triton Kernel Optimization

RLTune: Hybrid Learning and Optimization-Based Dynamic Scheduling for DL Workloads on Heterogeneous GPU Clusters

Hybrid Learning and Optimization-Based Dynamic Scheduling for DL Workloads on Heterogeneous GPU Clusters

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Applications

TRUST: the HPC open-source CFD platform – from CPU to GPU

GPU-acceleration of the Discontinuous Galerkin Shallow Water Equations Solver (DG-SWEM) using CUDA and OpenACC

miniLB: A Performance Portability Study of Lattice-Boltzmann Simulations

OpenACC offloading of the MFC compressible multiphase flow solver on AMD and NVIDIA GPUs

In-Situ Techniques on GPU-Accelerated Data-Intensive Applications

Direct Numerical Simulation of Turbulence on Heterogenous Computer Systems: Architectures, Algorithms, and Applications

Porting HPC Applications to AMD Instinct MI300A Using Unified Memory and OpenMP

OpenMP offload at the Exascale using Intel GPU Max 1550: evaluation of STREAmS compressible solver

Speed, power and cost implications for GPU acceleration of Computational Fluid Dynamics on HPC systems

Using AI libraries for Incompressible Computational Fluid Dynamics

High-order thread-safe lattice Boltzmann model for HPC turbulent flow simulations

Optimization of Ported CFD Kernels on Intel Data Center GPU Max 1550 using oneAPI ESIMD

Recent source codes

ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation

SeedFold: Scaling Biomolecular Structure Prediction

Tilus: A Tile-Level GPU Kernel Programming Language

Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs

BoltzGen:Toward Universal Binder Design

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution

MATLAB Tensor Core models

TritonForge: Transform PyTorch Operations into Optimized GPU Kernels with LLMs

RLTune: Hybrid Learning and Optimization-Based Dynamic Scheduling for DL Workloads on Heterogeneous GPU Clusters

Most viewed papers (last 30 days)