Brandon Alexander Burtchell, Martin Burtscher
Mert Hidayetoglu, Simon Garcia de Gonzalo, Elliott Slaughter, Pinku Surana, Wen-mei Hwu, William Gropp, Alex Aiken
Ronald M. Caplan, Miko M. Stulajter, Jon A. Linker, Jeff Larkin, Henry A. Gabb, Shiquan Su, Ivan Rodriguez, Zachary Tschirhart, Nicholas Malaya
Tags: Computer science, Fortran, Intel, Intel Data Center GPU Max 1550, Intel Ponte Vecchio Max 1100, nVidia, nVidia A100, nVidia GH200, nVidia H100, OpenACC, OpenMP, Package
Yi Ju, Mingshuai Li, Adalberto Perez, Laura Bellentani, Niclas Jansson, Stefano Markidis, Philipp Schlatter, Erwin Laure
Seonho Lee, Amar Phanishayee, Divya Mahajan
Tags: Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia H100, nVidia P100, nVidia V100, Performance, PyTorch, Tesla T4
Milo Lurati, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven
Tags: AMD Radeon Instinct MI250X, AMD Radeon Pro W6600, ATI, Computer science, CUDA, HIP, nVidia, nVidia A100, nVidia RTX A4000, Package, Performance, Python
Pasqua D'Ambra, Fabio Durastante, Salvatore Filippone
Arun Thangamani, Vincent Loechner, Stéphane Genaud
Avinash Maurya, Jie Ye, M. Mustafa Rafique, Franck Cappello, Bogdan Nicolae
Muyan Hu, Ashwin Venkatram, Shreyashri Biswas, Balamurugan Marimuthu, Bohan Hou, Gabriele Oliaro, Haojie Wang, Liyan Zheng, Xupeng Miao, Jidong Zhai
Johannes Pekkilä, Oskar Lappi, Fredrik Robertsén, Maarit J. Korpi-Lagg
Tags: AMD Radeon Instinct MI100, AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, Energy-efficient computing, HIP, nVidia, nVidia A100, nVidia V100, Package, Performance, PyTorch, Stencil computation