Jinfan Chen, Shigang Li, Ran Gun, Jinhui Yuan, Torsten Hoefler
Sridutt Bhalachandra, Brian Austin, Samuel Williams, Nicholas J. Wright
December 25, 2022 by
hgpuMuhammad Osama
Tags: Algorithms, Computer science, CUDA, Linear Algebra, load balancing, Matrix multiplication, nVidia, nVidia A100, Package, Sparse, Thesis
December 25, 2022 by
hgpuRené Caspart, Sebastian Ziegler, Arvid Weyrauch, Holger Obermaier, Simon Raffeiner, Leon Pascal Schuhmacher, Jan Scholtyssek, Darya Trofimova, Marco Nolden, Ines Reinartz, Fabian Isensee, Markus Götz, Charlotte Debus
December 11, 2022 by
hgpuPhilipp A. Witte, Russell J. Hewett, Kumar Saurabh, AmirHossein Sojoodi, Ranveer Chandra
Tags: AI, Cloud, Computational Physics, Computer science, Deep learning, Differential equations, Neural networks, nVidia, nVidia A100, nVidia DGX-A100, Partial differential equations, PDEs
November 27, 2022 by
hgpuGuyue Huang, Yang Bai, Liu Liu, Yuke Wang, Bei Yu, Yufei Ding, Yuan Xie
Connor Kenyon, Collin Capano
Xiangyang Ju, Yunsong Wang, Daniel Murnane, Nicholas Choma, Steven Farrell, Paolo Calafiura
Tags: Artificial intelligence, Benchmarking, Computer science, CUDA, Deep learning, Machine learning, Neural networks, nVidia, nVidia A100, Pattern recognition, Tesla V100, TPU
Yu-Hsiang M. Tsai, Terry Cojean, Hartwig Anzt
Tags: AMD Radeon Instinct MI100, ATI, Computer science, CUDA, Linear Algebra, nVidia, nVidia A100, OpenCL, Package, performance portability, Sparse, SYCL
Gregor Daiß, Patrick Diehl, Dominic Marcello, Alireza Kheirkhahan, Hartmut Kaiser, Dirk Pflüger
Richard Schoonhoven, Ben van Werkhoven, Kees Joost Batenburg
Tags: AMD Radeon Instinct Mi50, ATI, Auto-Tuning, Benchmarking, Computer science, CUDA, nVidia, nVidia A100, nVidia GeForce GTX 1080 Ti, nVidia GeForce GTX Titan X, nVidia Titan RTX, OpenCL, Performance, pyCUDA, PyOpenCL, Tesla K20, Tesla P100, Tesla V100