Giang Nguyen, Johir Islam, Rangeet Pan, Hridesh Rajan
December 12, 2021 by
hgpuZane Fink, Simeng Liu, Jaemin Choi, Matthias Diener, Laxmikant V. Kale
Tags: Benchmarking, Computer science, CUDA, Distributed computing, HPC, Machine learning, MPI, nVidia, Package, Performance, Python, Tesla V100
November 14, 2021 by
hgpuYao-Yuan Yang, Moto Hira, Zhaoheng Ni, Anjali Chourdia, Artyom Astafurov, Caroline Chen, Ching-Feng Yeh, Christian Puhrsch, David Pollack, Dmitriy Genzel, Donny Greenberg, Edward Z. Yang, Jason Lian, Jay Mahadeokar, Jeff Hwang, Ji Chen, Peter Goldsborough, Prabhat Roy, Sean Narenthiran, Shinji Watanabe, Soumith Chintala, Vincent Quenneville-Bélair, Yangyang Shi
Nawras Alnaasan, Arpan Jain, Aamir Shafi, Hari Subramoni, Dhabaleswar K Panda
Tian Lan, Sunil Srinivasa, Stephan Zheng
September 5, 2021 by
hgpuK.I. Mihajlenko, M.A. Lukin, A.S. Stankevich
Alexandros Nikolaos Ziogas, Timo Schneider, Tal Ben-Nun, Alexandru Calotoiu, Tiziano De Matteis, Johannes de Fine Licht, Luca Lavarini, Torsten Hoefler
Boyuan Feng, Yuke Wang, Tong Geng, Ang Li, Yufei Ding
Tags: Algorithms, Computer science, CUDA, Deep learning, Neural networks, nVidia, nVidia GeForce RTX 3090, Package, Precision, Python, Tesla A100
Alexandros Nikolaos Ziogas, Tal Ben-Nun, Timo Schneider, Torsten Hoefler
B. Boys, T. J. Dodwell, M. Hobbs, M. Girolami