Robert Tjarko Lange, Aaditya Prasad, Qi Sun, Maxence Faldor, Yujin Tang, David Ha
February 24, 2025 by
hgpuRahulkumar Gayatri, Shilei Tian, Stephen Olivier, Johannes Doerfert, Eric Wright
Tags: AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, HIP, MPI, nVidia, nVidia A100, OpenMP, Package, performance portability
February 16, 2025 by
hgpuYafan Huang, Sheng Di, Guanpeng Li, Franck Cappello
February 16, 2025 by
hgpuNicolas Nytko, Andrew Reisner, J. David Moulton, Luke N. Olson, Matthew West
February 16, 2025 by
hgpuYichao Yuan, Advait Iyer, Lin Ma, Nishil Talati
February 16, 2025 by
hgpuTirth Vamja, Kaustabha Ray, Felix George, UmaMaheswari C Devi
Nozal Raúl, Jose Luis Bosque
Tags: Computer science, CUDA, Heterogeneous systems, Hybrid computing, LLVM, load balancing, nVidia, nVidia GeForce GT 1030, oneAPI, OpenCL, performance portability, SYCL
Dahua Feng, Zhiming Xu, Rongxiang Wang, Felix Xiaozhu Lin
Tags: AI, Apple M2 Max, Apple M2 Pro, Apple M2 Ultra, Computer science, CUDA, Linear Algebra, LLM, Machine learning, nVidia, nVidia GeForce RTX 4090, nVidia GeFroce RTX 2080 Ti, nVidia Quadro RTX 4000, nVidia RTX A6000, Performance, PyTorch
Yihao Sun, Sidharth Kumar, Thomas Gilray, Kristopher Micinski