Weile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Qiang Wang, Xiaowen Chu
Tags: Artificial intelligence, Benchmarking, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia H800, Performance, PTX
February 25, 2024 by
hgpuJoshua H. Davis, Pranav Sivaraman, Isaac Minn, Konstantinos Parasyris, Harshitha Menon, Giorgis Georgakoudis, Abhinav Bhatele
Tags: AMD Radeon Instinct MI250X, AMD Radeon Instinct Mi50, ATI, Computer science, CUDA, Heterogeneous systems, HIP, MPI, nVidia, nVidia V100, OpenACC, OpenMP, Performance, performance portability, SYCL
February 18, 2024 by
hgpuRuben Laso, Diego Krupitza, Sascha Hunold
February 18, 2024 by
hgpuDimitrios Danopoulos, Georgios Zervakis, Dimitrios Soudris, Jörg Henkel
February 18, 2024 by
hgpuJonathan Strobl, Leonardo Solis-Vasquez, Yannick Lavan, Andreas Koch
February 18, 2024 by
hgpuTaesu Kim, Jongho Lee, Daehyun Ahn, Sarang Kim, Jiwoong Choi, Minkyu Kim, Hyungjun Kim
Tags: Computer science, CUDA, Deep learning, Machine learning, Matrix multiplication, Mixed precision, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia RTX A6000, Package
February 18, 2024 by
hgpuOmer Dunay, Daniel Cheng, Adam Tait, Parth Thakkar, Peter C Rigby, Andy Chiu, Imad Ahmad, Arun Ganesan, Chandra Maddila, Vijayaraghavan Murali, Ali Tayyebi, Nachiappan Nagappan
February 12, 2024 by
hgpuDaya Guo, Qihao Zhu, Dejian Yang, Zhenda Xie, Kai Dong, Wentao Zhang, Guanting Chen, Xiao Bi, Y. Wu, Y.K. Li, Fuli Luo, Yingfei Xiong, Wenfeng Liang
February 12, 2024 by
hgpuChengyi Nie, Jessica Maghakian, Zhenhua Liu
February 12, 2024 by
hgpuGianmarco Accordi, Davide Gadioli, Emanele Vitali, Luigi Crisci, Biagio Cosenza, Andrea Beccari, Gianluca Palermo
February 12, 2024 by
hgpuHunter McCoy, Prashant Pandey