Peter Thoman, Philip Salzmann
Zachary Cooper-Baldock, Brenda Vara Almirall, Kiao Inthavong
Xinyi Li, Ang Li, Bo Fang, Katarzyna Swirydowicz, Ignacio Laguna, Ganesh Gopalakrishnan
Tags: AMD Radeon Instinct MI100, AMD Radeon Instinct MI250X, ATI, Computer science, Hardware Architecture, HPC, Matrix multiplication, nVidia, nVidia A100, nVidia H100, nVidia V100, PTX
Manuel Costanzo, Enzo Rucci, Carlos García-Sanchez, Marcelo Naiouf, Manuel Prieto-Matías
Tags: AMD Radeon RX 6700 XT, AMD Radeon RX Vega 6, ATI, Bioinformatics, Biology, Computational biology, CUDA, Heterogeneous systems, Intel Arc A770, Intel UHD 630, nVidia, nVidia GeForce GTX 1080, nVidia GeForce GTX 980, nVidia GeForce RTX 2070, nVidia GeForce RTX 3090, nVidia V100, oneAPI, Package, Sequence alignment, SYCL
February 25, 2024 by
hgpuDimitrios Danopoulos, Georgios Zervakis, Dimitrios Soudris, Jörg Henkel
February 18, 2024 by
hgpuJoshua H. Davis, Pranav Sivaraman, Isaac Minn, Konstantinos Parasyris, Harshitha Menon, Giorgis Georgakoudis, Abhinav Bhatele
Tags: AMD Radeon Instinct MI250X, AMD Radeon Instinct Mi50, ATI, Computer science, CUDA, Heterogeneous systems, HIP, MPI, nVidia, nVidia V100, OpenACC, OpenMP, Performance, performance portability, SYCL
February 18, 2024 by
hgpuKa Hei Martin Kwok, Matti Kortelainen, Giuseppe Cerati, Alexei Strelchenko, Oliver Gutsche, Allison Reinsvold Hall, Steve Lantz, Michael Reid, Daniel Riley, Sophie Berkman, Seyong Lee, Hammad Ather, Boyana Norris, Cong Wang
Tags: AMD Radeon Instinct MI100, ATI, HEP, Intel, Intel Arc A770, nVidia, nVidia A100, nVidia V100, OpenMP, performance portability, Physics, SYCL
Qian Gong, Jieyang Chen, Ben Whitney, Xin Liang, Viktor Reshniak, Tania Banerjee, Jaemoon Lee, Anand Rangarajan, Lipeng Wan, Nicolas Vidal, Qing Liu, Ana Gainaru, Norbert Podhorszki, Richard Archibald, Sanjay Ranka, Scott Klasky
Tags: Compression, Computer science, CUDA, HIP, HPC, Numerical Analysis, nVidia, nVidia V100, OpenMP, Package, SYCL
Shiwei Zhang, Lansong Diao, Chuan Wu, Zongyan Cao, Siyu Wang, Wei Lin
Tags: Computer science, CUDA, Deep learning, Distributed computing, GPU cluster, nVidia, nVidia A100, nVidia P100, nVidia V100, Package, PyTorch
Foteini Strati, Xianzhe Ma, Ana Klimovic
Aristotle Martin, Geng Liu, William Ladd, Seyong Lee, John Gounley, Jeffrey Vetter, Saumil Patel, Silvio Rizzi, Victor Mateevitsi, Joseph Insley, Amanda Randles
Tags: AMD Radeon Instinct MI250X, ATI, cfd, CUDA, Fluid dynamics, Heterogeneous systems, HIP, nVidia, nVidia A100, nVidia V100, performance portability, SYCL
December 31, 2023 by
hgpuKonstantinos Parasyris, Giorgis Georgakoudis, Esteban Rangel, Ignacio Laguna, Johannes Doerfert
December 18, 2023 by
hgpu