Weihang Gao, Teng Zhao, Yongfa Guo, Jiuyang Liang, Huan Liu, Maoying Luo, Zedong Luo, Wei Qin, Yichao Wang, Qi Zhou, Shi Jin, Zhenli Xu
Jiacheng Yang, Christina Giannoula, Jun Wu, Mostafa Elhoushi, James Gleeson, Gennady Pekhimenko
Tags: Cloud, Computer science, CUDA, Matrix multiplication, nVidia, nVidia GeForce RTX 2070, nVidia GeForce RTX 2080 Ti, nVidia GeForce RTX 3090, Package, Performance, PyTorch, Tesla A100
Anton Rydahl, Joseph Huber, Ethan Luis McDonough, Johannes Doerfert
Tags: AMD Radeon Instinct MI250X, AMD Radeon Instinct Mi50, ATI, Benchmarking, Compilers, Computer science, CUDA, HIP, Numerical Analysis, OpenMP, Performance, Tesla A100, Tesla V100
December 18, 2023 by
hgpuMingyu Liang, Wenyin Fu, Louis Feng, Zhongyi Lin, Pavani Panakanti, Shengbao Zheng, Srinivas Sridharan, Christina Delimitrou
Tags: AI, Benchmarking, Code generation, Computer science, CUDA, nVidia, Package, Performance, PyTorch, Tesla A100, Tesla V100
Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña
Alok Mishra, Abid M. Malik, Meifeng Lin, Barbara Chapman
Tags: AMD Radeon Instinct Mi50, ATI, Benchmarking, Computer science, Heterogeneous systems, Machine learning, nVidia, nVidia GeForce RTX 2080, OpenMP, Tesla A100, Tesla K80, Tesla V100
Richard Schoonhoven, Bram Veenboer, Ben van Werkhoven, Kees Joost Batenburg
November 20, 2022 by
hgpuAndré Müller, Bertil Schmidt, Richard Membarth, Roland Leißa, Sebastian Hack
Tags: AMD Radeon Instinct MI100, ATI, Bioinformatics, Biology, CUDA, Next-Generation sequencing, nVidia, nVidia GeForce RTX 3090, OpenCL, Package, Sequence alignment, Tesla A100
Tal Ben-Nun, Linus Groner, Florian Deconinck, Tobias Wicky, Eddie Davis, Johann Dahm, Oliver Elbert, Rhea George, Jeremy McGibbon, Lukas Trümper, Elynn Wu, Oliver Fuhrer, Thomas Schulthess, Torsten Hoefler
Linnan Wang, Chenhan Yu, Satish Salian, Slawomir Kierat, Szymon Migacz, Alex Fit Florea
Dominik Ernst, Markus Holzer, Georg Hager, Matthias Knorr, Gerhard Wellein