high performance computing on graphics processing units: hgpu.org

Oliver Hennigh, Susheela Narasimhan, Mohammad Amin Nabian, Akshay Subramaniam, Kaustubh Tangsali, Max Rietmann, Jose del Aguila Ferrandis, Wonmin Byeon, Zhiwei Fang, Sanjay Choudhry

View

Download (PDF)

Tags: cfd, CUDA, Fluid dynamics, Linear Algebra, Machine learning, Neural networks, nVidia, nVidia A100, Partial differential equations, PDEs, Physics, Tesla V100, Video

December 20, 2020 by hgpu

Bempp-cl: A fast Python based just-in-time compiling boundary element library

Timo Betcke, Matthew W. Scroggs

View

Download (PDF)

Source codes

Tags: Computer science, Differential equations, Integral equations, OpenCL, Package, Partial differential equations, PDEs, Python

October 11, 2020 by hgpu

Unsupervised Deep Learning of Incompressible Fluid Dynamics

Nils Wandel, Michael Weinmann, Reinhard Klein

View

Download (PDF)

Tags: Deep learning, Differential equations, Fluid dynamics, Fluid simulation, Neural networks, nVidia, nVidia GeForce GTX 2080 Ti, Partial differential equations, PDEs, PyTorch

June 21, 2020 by hgpu

SYCL Code Generation for Multigrid Methods

Stefan Groth, Christian Schmitt, Jürgen Teich, and Frank Hannig

View

Download (PDF)

Source codes

Tags: Code generation, Computer science, Differential equations, OpenCL, Package, Partial differential equations, PDEs

June 16, 2019 by hgpu

Parallel scalable simulations of biological neural networks using TensorFlow: A beginner’s guide

Saptarshi Soham Mohanta, Collins Assisi

View

Download (PDF)

Source codes

Tags: Biology, Computer science, CUDA, Deep learning, Differential equations, Machine learning, Neural networks, nVidia, ODEs, Ordinary differential equations, Package, Partial differential equations, PDEs, Python, TensorFlow, Tutorial

June 12, 2019 by hgpu

Efficient Implementation and Optimization of Geometric Multigrid Operations in the LIFT Framework

Martin Lucke

View

Download (PDF)

Tags: Computer science, Differential equations, nVidia, nVidia GeForce GTX 1080, OpenCL, Partial differential equations, PDEs, performance portability, Thesis

January 27, 2019 by hgpu

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

No More Shading Languages: Compiling C++ to Vulkan Shaders

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

* * *

high performance computing on graphics processing units: hgpu.org

Reproducible Study and Performance Analysis of GPU Programming Paradigms: OpenACC vs. CUDA in Key Linear Algebra Computations

FortranX: Harnessing Code Generation, Portability, and Heterogeneity in Fortran

Using AI libraries for Incompressible Computational Fluid Dynamics

ProtoX: A First Look

GPU Offloading in ExaHyPE Through C++ Standard Algorithms

NVIDIA SimNet: an AI-accelerated multi-physics simulation framework

Bempp-cl: A fast Python based just-in-time compiling boundary element library

Unsupervised Deep Learning of Incompressible Fluid Dynamics

SYCL Code Generation for Multigrid Methods

Parallel scalable simulations of biological neural networks using TensorFlow: A beginner’s guide

Efficient Implementation and Optimization of Geometric Multigrid Operations in the LIFT Framework

Recent source codes

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Most viewed papers (last 30 days)