high performance computing on graphics processing units: hgpu.org

hgpu.org » Finite difference

Seamless GPU acceleration for C++ based physics with the Metal Shading Language on Apple’s M series unified chips

Lars Gebraad, Andreas Fichtner

View

Download (PDF)

Source codes

Tags: Benchmarking, Computer science, Finite difference, nVidia, Package, Physics, Wave equation

June 12, 2022 by hgpu

Scalable communication for high-order stencil computations using CUDA-aware MPI

Johannes Pekkilä, Miikka S. Väisälä, Maarit J. Käpylä, Matthias Rheinhardt, Oskar Lappi

View

Download (PDF)

Tags: Computer science, CUDA, Finite difference, Magnetohydrodynamics, MPI, nVidia, Tesla V100

March 7, 2021 by hgpu

CPU/GPU Code Acceleration on Heterogeneous Systems and Code Verification for CFD Applications

Weicheng Xue

View

Download (PDF)

Tags: cfd, CUDA, Finite difference, Fluid dynamics, Heterogeneous systems, MPI, nVidia, OpenACC, Tesla C2075, Tesla P100, Tesla V100, Thesis

January 31, 2021 by hgpu

A mechanism for balancing accuracy and scope in cross-machine black-box GPU performance modeling

James D. Stevens

View

Download (PDF)

Tags: AMD Radeon R9 Fury, ATI, Code generation, Computer science, Finite difference, Heterogeneous systems, Matrix multiplication, nVidia, nVidia GeForce GTX Titan X, OpenCL, Performance, Tesla C2070, Tesla K40

April 28, 2019 by hgpu

cuSten – CUDA Finite Difference and Stencil Library

Andrew Gloster, Lennon O'Naraigh

View

Download (PDF)

Source codes

Tags: Computational Physics, Computer science, CUDA, Finite difference, MPI, nVidia, nVidia GeForce GTX Titan X, Package, PDEs

March 3, 2019 by hgpu

Performance Portability Challenges for Fortran Applications

Abigail Hsu, David Neill Asanza, Joseph A. Schoonover, Zach Jibben, Neil N. Carlson, Robert Robey

View

Download (PDF)

Source codes

Tags: Computer science, CUDA, Finite difference, Fortran, nVidia, OpenMP, Package, performance portability, Tesla P100, Tesla V100

December 2, 2018 by hgpu

Energy Consumption of Algorithms for Solving the Compressible Navier-Stokes Equations on CPU’s, GPU’s and KNL’s

Satya P. Jammy, Christian T. Jacobs, David J. Lusher, Neil D. Sandham

View

Download (PDF)

Tags: CUDA, Finite difference, Fluid dynamics, Intel Xeon Phi, Navier-Stokes equations, NSEs, nVidia, Tesla K40

July 7, 2018 by hgpu

Energy efficiency of finite difference algorithms on multicore CPUs, GPUs, and Intel Xeon Phi processors

Satya P. Jammy, Christian T. Jacobs, David J. Lusher, Neil D. Sandham

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, Energy-efficient computing, Finite difference, Intel Xeon Phi, nVidia, Tesla K40

October 3, 2017 by hgpu

Non-Hydrostatic Pressure Shallow Flows: GPU Implementation Using Finite-Volume and Finite-Difference Scheme

C. Escalante, T. Morales de Luna, M.J. Castro

View

Download (PDF)

Tags: Algorithms, CUDA, Finite difference, Fluid dynamics, Mathematics, Numerical Analysis, nVidia

June 17, 2017 by hgpu

Speeding up a few orders of magnitude the Jacobi method: high order Chebyshev-Jacobi over GPUs

J.E. Adsuara, I. Cordero-Carrion, P. Cerda-Duran, M.A. Aloy

View

Download (PDF)

Tags: CUDA, Differential equations, Finite difference, Mathematics, Numerical Analysis, nVidia, nVidia GeForce GTX Titan X, Partial differential equations, PDEs, Tesla K40

May 2, 2017 by hgpu

OpenCL-Based FPGA Accelerator for 3D FDTD with Periodic and Absorbing Boundary Conditions

Hasitha Muthumala Waidyasooriya, Tsukasa Endo, Masanori Hariyama, Yasuo Ohtera

View

Download (PDF)

Tags: Computer science, Differential equations, FDTD, Finite difference, Finite-difference time-domain, FPGA, OpenCL, Partial differential equations, PDEs

April 26, 2017 by hgpu

Parallel Level set algorithm with MPI and accelerated on GPU

Zhenlin Wang

View

Download (PDF)

Source codes

Tags: Computational Physics, CUDA, Finite difference, MPI, nVidia, Package, PDEs, Physics

December 17, 2016 by hgpu

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

No More Shading Languages: Compiling C++ to Vulkan Shaders

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

* * *

high performance computing on graphics processing units: hgpu.org

Seamless GPU acceleration for C++ based physics with the Metal Shading Language on Apple’s M series unified chips

Scalable communication for high-order stencil computations using CUDA-aware MPI

CPU/GPU Code Acceleration on Heterogeneous Systems and Code Verification for CFD Applications

A mechanism for balancing accuracy and scope in cross-machine black-box GPU performance modeling

cuSten – CUDA Finite Difference and Stencil Library

Performance Portability Challenges for Fortran Applications

Energy Consumption of Algorithms for Solving the Compressible Navier-Stokes Equations on CPU’s, GPU’s and KNL’s

Energy efficiency of finite difference algorithms on multicore CPUs, GPUs, and Intel Xeon Phi processors

Non-Hydrostatic Pressure Shallow Flows: GPU Implementation Using Finite-Volume and Finite-Difference Scheme

Speeding up a few orders of magnitude the Jacobi method: high order Chebyshev-Jacobi over GPUs

OpenCL-Based FPGA Accelerator for 3D FDTD with Periodic and Absorbing Boundary Conditions

Parallel Level set algorithm with MPI and accelerated on GPU

Recent source codes

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Most viewed papers (last 30 days)