Marcin Spoczynski, Marcela S. Melara
Andreas Herten, Olga Pearce, Filipe S. M. Guimarães
Tags: Benchmarking, Computer science, CUDA, Fortran, HIP, HPC, MPI, OpenACC, OpenCL, OpenMP, Package, Performance, ROCm, SYCL
September 14, 2025 by
hgpuEvelyne Ringoot, Rabab Alomairy, Valentin Churavy, Alan Edelman
Tags: AMD Radeon Instinct MI250, Apple M1 Pro, ATI, Computer science, HIP, Intel, Intel Ponte Vecchio Max 1100, Kokkos, Linear Algebra, Machine learning, nVidia, nVidia A100, nVidia GeForce RTX 4060, nVidia H100, OpenCL, SYCL
Hari Abram, Nikela Papadopoulou, Miquel Pericas
Aymeric Millan, Thomas Padioleau, Julien Bigot
Tags: AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, FFT, Neural networks, nVidia, nVidia A100, Package, performance portability, SYCL
Timothée David--Cléris, Guillaume Laibe, Yona Lapeyre
Tags: AMD, AMD Radeon Instinct MI250X, Astrophysics, CUDA, MPI, nVidia, nVidia A100, OpenMP, Package, Physics, PTX, ROCm, SYCL
Fabian Knorr, Philip Salzmann, Peter Thoman, Thomas Fahringer
Dewei Wang, Wei Zhu, Liyang Ling, Ettore Tiotto, Quintin Wang, Whitney Tsang, Julian Opperman, Jacky Deng
Nozal Raúl, Jose Luis Bosque
Tags: Computer science, CUDA, Heterogeneous systems, Hybrid computing, LLVM, load balancing, nVidia, nVidia GeForce GT 1030, oneAPI, OpenCL, performance portability, SYCL
Cristian Campos, Rafael Asenjo, Angeles Navarro