Tirth Vamja, Kaustabha Ray, Felix George, UmaMaheswari C Devi
Nozal Raúl, Jose Luis Bosque
Tags: Computer science, CUDA, Heterogeneous systems, Hybrid computing, LLVM, load balancing, nVidia, nVidia GeForce GT 1030, oneAPI, OpenCL, performance portability, SYCL
Dahua Feng, Zhiming Xu, Rongxiang Wang, Felix Xiaozhu Lin
Tags: AI, Apple M2 Max, Apple M2 Pro, Apple M2 Ultra, Computer science, CUDA, Linear Algebra, LLM, Machine learning, nVidia, nVidia GeForce RTX 4090, nVidia GeFroce RTX 2080 Ti, nVidia Quadro RTX 4000, nVidia RTX A6000, Performance, PyTorch
Yihao Sun, Sidharth Kumar, Thomas Gilray, Kristopher Micinski
Weile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Hongyuan Liu, Qiang Wang, Xiaowen Chu
Dinei A. Rockenbach, Gabriell Araujo, Dalvan Griebler, Luiz Gustavo Fernandes
Csaba Tóth, Danilo Jr Dela Cruz, Harald Oberhauser