Yuhao Zhou, Peng Jia, Jiayong Liu, Ximing Fan
Chris S. Lin, Joyce Qu, Gururaj Saileshwar
Zicong Ye, Kunming Zhang, Guoming Tang
Kunming Zhang, Hanlong Liao, Guoming Tang
Justus Henneberg, Felix Schuhknecht
Abhishek Ghosh, Ajay Nayak, Ashish Panwar, Arkaprava Basu
Rodrigo Huerta, Mojtaba Abaie Shoushtary, José-Lorenzo Cruz, Antonio González
Mohammad Atif, Tianle Wang, Zhihua Dong, Charles Leggett, Meifeng Lin
Radostin Stoyanov, Viktória Spišaková, Jesus Ramos, Steven Gurfinkel, Andrei Vagin, Adrian Reber, Wesley Armour, Rodrigo Bruno
Tags: AMD Radeon Instinct MI210, ATI, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia H100, nVidia RTX A6000, Package, ROCm
Youhe Jiang, Fangcheng Fu, Xiaozhe Yao, Guoliang He, Xupeng Miao, Ana Klimovic, Bin Cui, Binhang Yuan, Eiko Yoneki
February 10, 2025 by
hgpuDahua Feng, Zhiming Xu, Rongxiang Wang, Felix Xiaozhu Lin
Tags: AI, Apple M2 Max, Apple M2 Pro, Apple M2 Ultra, Computer science, CUDA, Linear Algebra, LLM, Machine learning, nVidia, nVidia GeForce RTX 4090, nVidia GeFroce RTX 2080 Ti, nVidia Quadro RTX 4000, nVidia RTX A6000, Performance, PyTorch
Jiaping Wang, Simiao Zhang, Qiao-Chu He, Yifan Chen
Tags: Benchmarking, Computer science, CUDA, LLM, Machine learning, nVidia, nVidia A100, nVidia RTX A6000, Package, Python, PyTorch