Tirth Vamja, Kaustabha Ray, Felix George, UmaMaheswari C Devi
Weile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Hongyuan Liu, Qiang Wang, Xiaowen Chu
Csaba Tóth, Danilo Jr Dela Cruz, Harald Oberhauser
Jonah Ekelund, Stefano Markidis, Ivy Peng
Jiaping Wang, Simiao Zhang, Qiao-Chu He, Yifan Chen
Tags: Benchmarking, Computer science, CUDA, LLM, Machine learning, nVidia, nVidia A100, nVidia RTX A6000, Package, Python, PyTorch
Yunjae Lee, Juntaek Lim, Jehyeon Bang, Eunyeong Cho, Huijong Jeong, Taesu Kim, Hyungjun Kim, Joonhyung Lee, Jinseop Im, Ranggi Hwang, Se Jung Kwon, Dongsoo Lee, Minsoo Rhu
Lourens van Niekerk, Dhiraj Kumar, Aasish Kumar Sharma, Tino Meisel, Martin Leandro Paleico, Christian Boehme
Gregor Daiß, Patrick Diehl, Jiakun Yan, John K. Holmen, Rahulkumar Gayatri, Christoph Junghans, Alexander Straub, Jeff R. Hammond, Dominic Marcello, Miwako Tsuji, Dirk Pflüger, Hartmut Kaiser
Tags: AMD Radeon Instinct MI100, AMD Radeon Instinct MI250X, Astrophysics, ATI, Computer science, CUDA, Heterogeneous systems, HIP, HPC, nVidia, nVidia A100, Package, performance portability, Physics
December 29, 2024 by
hgpuKristoffer August Kortbæk, Rune Ejnar Bang Lejbølle
December 24, 2024 by
hgpuAman Chaturvedi, Daniel Nichols, Siddharth Singh, Abhinav Bhatele
Tags: Code generation, Computer science, CUDA, HIP, HPC, LLM, MPI, nVidia, nVidia A100, OpenMP, Package
December 24, 2024 by
hgpu