high performance computing on graphics processing units: hgpu.org

hgpu.org » AMD Radeon Instinct MI250X

Exploring SYCL for batched kernels with memory allocations

Aymeric Millan, Thomas Padioleau, Julien Bigot

View

Download (PDF)

Source codes

Tags: AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, FFT, Neural networks, nVidia, nVidia A100, Package, performance portability, SYCL

May 25, 2025 by hgpu

The Shamrock code: I- Smoothed Particle Hydrodynamics on GPUs

Timothée David--Cléris, Guillaume Laibe, Yona Lapeyre

View

Download (PDF)

Source codes

Tags: AMD, AMD Radeon Instinct MI250X, Astrophysics, CUDA, MPI, nVidia, nVidia A100, OpenMP, Package, Physics, PTX, ROCm, SYCL

March 23, 2025 by hgpu

Leveraging LLVM OpenMP GPU Offload Optimizations for Kokkos Applications

Rahulkumar Gayatri, Shilei Tian, Stephen Olivier, Johannes Doerfert, Eric Wright

View

Download (PDF)

Source codes

Tags: AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, HIP, MPI, nVidia, nVidia A100, OpenMP, Package, performance portability

February 16, 2025 by hgpu

Asynchronous-Many-Task Systems: Challenges and Opportunities – Scaling an AMR Astrophysics Code on Exascale machines using Kokkos and HPX

Gregor Daiß, Patrick Diehl, Jiakun Yan, John K. Holmen, Rahulkumar Gayatri, Christoph Junghans, Alexander Straub, Jeff R. Hammond, Dominic Marcello, Miwako Tsuji, Dirk Pflüger, Hartmut Kaiser

View

Download (PDF)

Source codes

Tags: AMD Radeon Instinct MI100, AMD Radeon Instinct MI250X, Astrophysics, ATI, Computer science, CUDA, Heterogeneous systems, HIP, HPC, nVidia, nVidia A100, Package, performance portability, Physics

December 29, 2024 by hgpu

Scaling SU(2) to 1000 GPUs using HiRep

Sofie Martins, Erik Kjellgren, Emiliano Molinaro, Claudio Pica, Antonio Rago

View

Download (PDF)

Source codes

Tags: AMD Radeon Instinct MI250X, ATI, CUDA, HEP, High Energy Physics - Lattice, HIP, Monte Carlo simulation, nVidia, nVidia H100, Package, Physics

December 1, 2024 by hgpu

Performance portability via C++ PSTL, SYCL, OpenMP, and HIP: the Gaia AVU-GSR case study

Giulio Malenza, Valentina Cesare, Marco Edoardo Santimaria, Robert Birke, Alberto Vecchiato, Ugo Becciani, Marco Aldinucci

View

Download (PDF)

Source codes

Tags: AMD Radeon Instinct MI250X, Astrophysics, ATI, Computer science, CUDA, HIP, HPC, nVidia, nVidia A100, nVidia H100, nVidia V100, OpenMP, Package, Performance, performance portability, SYCL, Tesla T4

November 24, 2024 by hgpu

Understanding Data Movement in AMD Multi-GPU Systems with Infinity Fabric

Gabin Schieffer, Ruimin Shi, Stefano Markidis, Andreas Herten, Jennifer Faj, Ivy Peng

View

Download (PDF)

Tags: AMD, AMD Radeon Instinct MI250X, ATI, Computer science, HIP, Machine learning, Memory, Performance

October 6, 2024 by hgpu

OpenACC offloading of the MFC compressible multiphase flow solver on AMD and NVIDIA GPUs

Benjamin Wilfong, Anand Radhakrishnan, Henry A. Le Berre, Steve Abbott, Reuben D. Budiardja, Spencer H. Bryngelson

View

Download (PDF)

Source codes

Tags: AMD Radeon Instinct MI250X, ATI, cfd, Fluid dynamics, MPI, nVidia, OpenACC, Package, Tesla V100

September 29, 2024 by hgpu

Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects

Daniele De Sensi, Lorenzo Pichetti, Flavio Vella, Tiziano De Matteis, Zebin Ren, Luigi Fusco, Matteo Turisini, Daniele Cesarini, Kurt Lust, Animesh Trivedi, Duncan Roweth, Filippo Spiga, Salvatore Di Girolamo, Torsten Hoefler

View

Download (PDF)

Tags: AMD Radeon Instinct MI250X, ATI, Benchmarking, Computer science, CUDA, HPC, MPI, nVidia, nVidia A100, nVidia H100, Performance