Abhijeet Saraha, Yuanbo Li, Chris Porter, Santosh Pande
September 7, 2025 by
hgpuXiyan Hu, Titus Parker, Connor Phillips, Yifa Yu
Tags: AMD Radeon Instinct MI210, AMD Radeon Instinct MI325X, ATI, Computer science, HIP, nVidia, nVidia A100, Package, Performance, PyTorch, ROCm, Thesis
Jiarong Xing, Yifan Qiao, Simon Mo, Xingqi Cui, Gur-Eyal Sela, Yang Zhou, Joseph Gonzalez, Ion Stoica
Neha Prakriya, Zijian Ding, Yizhou Sun, Jason Cong
Weijie Lv, Xuan Xia, Sheng-Jun Huang
Dimitar Mileski, Nikola Petrovski, Marjan Gusev
Abhishek Ghosh, Ajay Nayak, Ashish Panwar, Arkaprava Basu
Anne Ouyang, Simon Guo, Simran Arora, Alex L. Zhang, William Hu, Christopher Ré, Azalia Mirhoseini
February 24, 2025 by
hgpuDahua Feng, Zhiming Xu, Rongxiang Wang, Felix Xiaozhu Lin
Tags: AI, Apple M2 Max, Apple M2 Pro, Apple M2 Ultra, Computer science, CUDA, Linear Algebra, LLM, Machine learning, nVidia, nVidia GeForce RTX 4090, nVidia GeFroce RTX 2080 Ti, nVidia Quadro RTX 4000, nVidia RTX A6000, Performance, PyTorch
Ruijun Feng, Hammond Pearce, Pietro Liguori, Yulei Sui
Jiaping Wang, Simiao Zhang, Qiao-Chu He, Yifan Chen
Tags: Benchmarking, Computer science, CUDA, LLM, Machine learning, nVidia, nVidia A100, nVidia RTX A6000, Package, Python, PyTorch