Gabin Schieffer, Ruimin Shi, Stefano Markidis, Andreas Herten, Jennifer Faj, Ivy Peng
Benjamin Wilfong, Anand Radhakrishnan, Henry A. Le Berre, Steve Abbott, Reuben D. Budiardja, Spencer H. Bryngelson
September 29, 2024 by
hgpuDaniele De Sensi, Lorenzo Pichetti, Flavio Vella, Tiziano De Matteis, Zebin Ren, Luigi Fusco, Matteo Turisini, Daniele Cesarini, Kurt Lust, Animesh Trivedi, Duncan Roweth, Filippo Spiga, Salvatore Di Girolamo, Torsten Hoefler
Tags: AMD Radeon Instinct MI250X, ATI, Benchmarking, Computer science, CUDA, HPC, MPI, nVidia, nVidia A100, nVidia H100, Performance
September 1, 2024 by
hgpuAaron Jarmusch, Felipe Cabarcas, Swaroop Pophale, Andrew Kallai, Johannes Doerfert, Luke Peyralans, Seyong Lee, Joel Denny, Sunita Chandrasekaran
Tags: AMD Radeon Instinct MI210, AMD Radeon Instinct MI250X, ATI, Compilers, Computer science, Fortran, Heterogeneous systems, HPC, nVidia, nVidia H100, OpenMP
Mert Hidayetoglu, Simon Garcia de Gonzalo, Elliott Slaughter, Pinku Surana, Wen-mei Hwu, William Gropp, Alex Aiken
Milo Lurati, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven
Tags: AMD Radeon Instinct MI250X, AMD Radeon Pro W6600, ATI, Computer science, CUDA, HIP, nVidia, nVidia A100, nVidia RTX A4000, Package, Performance, Python
Johannes Pekkilä, Oskar Lappi, Fredrik Robertsén, Maarit J. Korpi-Lagg
Tags: AMD Radeon Instinct MI100, AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, Energy-efficient computing, HIP, nVidia, nVidia A100, nVidia V100, Package, Performance, PyTorch, Stencil computation
Andrey Alekseenko, Szilárd Páll, Erik Lindahl
Francesco Salvadore, Giacomo Rossi, Srikanth Sathyanarayana, Matteo Bernardini
Tags: AMD Radeon Instinct MI250X, ATI, Benchmarking, cfd, Compression, Fluid dynamics, Intel, Intel Data Center GPU Max 1550, nVidia, nVidia A100, OpenMP
John Tramm, Paul Romano, Patrick Shriwise, Amanda Lund, Johannes Doerfert, Patrick Steinbrecher, Andrew Siegel, Gavin Ridley
Tags: AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, Intel, Intel Data Center GPU Max 1550, Intel Ponte Vecchio Max 1100, nVidia, nVidia A100, OpenMP, Package, performance portability
Xinyi Li, Ang Li, Bo Fang, Katarzyna Swirydowicz, Ignacio Laguna, Ganesh Gopalakrishnan
Tags: AMD Radeon Instinct MI100, AMD Radeon Instinct MI250X, ATI, Computer science, Hardware Architecture, HPC, Matrix multiplication, nVidia, nVidia A100, nVidia H100, nVidia V100, PTX