Yafan Huang, Sheng Di, Guanpeng Li, Franck Cappello
February 16, 2025 by
hgpuRahulkumar Gayatri, Shilei Tian, Stephen Olivier, Johannes Doerfert, Eric Wright
Tags: AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, HIP, MPI, nVidia, nVidia A100, OpenMP, Package, performance portability
February 16, 2025 by
hgpuNicolas Nytko, Andrew Reisner, J. David Moulton, Luke N. Olson, Matthew West
February 16, 2025 by
hgpuTirth Vamja, Kaustabha Ray, Felix George, UmaMaheswari C Devi
Weile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Hongyuan Liu, Qiang Wang, Xiaowen Chu
Csaba Tóth, Danilo Jr Dela Cruz, Harald Oberhauser
Jonah Ekelund, Stefano Markidis, Ivy Peng
Jiaping Wang, Simiao Zhang, Qiao-Chu He, Yifan Chen
Tags: Benchmarking, Computer science, CUDA, LLM, Machine learning, nVidia, nVidia A100, nVidia RTX A6000, Package, Python, PyTorch
Yunjae Lee, Juntaek Lim, Jehyeon Bang, Eunyeong Cho, Huijong Jeong, Taesu Kim, Hyungjun Kim, Joonhyung Lee, Jinseop Im, Ranggi Hwang, Se Jung Kwon, Dongsoo Lee, Minsoo Rhu
Lourens van Niekerk, Dhiraj Kumar, Aasish Kumar Sharma, Tino Meisel, Martin Leandro Paleico, Christian Boehme