Ryan Swann, Muhammad Osama, Karthik Sangaiah, Jalal Mahmud
Peter Maucher, Lennard Kittner, Nico Rath, Gregor Lucka, Lukas Werling, Yussuf Khalil, Thorsten Gröninger, Frank Bellosa
Adrian Perez Dieguez, Min Choi, Mahmut Okyay, Mauro Del Ben, Bryan M. Wong, Khaled Z. Ibrahim
Dolores Miao, Ignacio Laguna, Giorgis Georgakoudis, Konstantinos Parasyris, Cindy Rubio-González
Youssef Faqir-Rhazoui, Carlos García
Sébastien Darche, Michel R. Dagenais
Adel Belkhiri, Michel Dagenais
February 25, 2024 by
hgpuWeile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Qiang Wang, Xiaowen Chu
Tags: Artificial intelligence, Benchmarking, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia H800, Performance, PTX
February 25, 2024 by
hgpuJoshua H. Davis, Pranav Sivaraman, Isaac Minn, Konstantinos Parasyris, Harshitha Menon, Giorgis Georgakoudis, Abhinav Bhatele
Tags: AMD Radeon Instinct MI250X, AMD Radeon Instinct Mi50, ATI, Computer science, CUDA, Heterogeneous systems, HIP, MPI, nVidia, nVidia V100, OpenACC, OpenMP, Performance, performance portability, SYCL
February 18, 2024 by
hgpuGianmarco Accordi, Davide Gadioli, Emanele Vitali, Luigi Crisci, Biagio Cosenza, Andrea Beccari, Gianluca Palermo
February 12, 2024 by
hgpuJolly Chen, Monica Dessole, Ana Lucia Varbanescu
Jiacheng Yang, Christina Giannoula, Jun Wu, Mostafa Elhoushi, James Gleeson, Gennady Pekhimenko
Tags: Cloud, Computer science, CUDA, Matrix multiplication, nVidia, nVidia GeForce RTX 2070, nVidia GeForce RTX 2080 Ti, nVidia GeForce RTX 3090, Package, Performance, PyTorch, Tesla A100