Jiashen Cao, Rathijit Sen, Matteo Interlandi, Joy Arulraj, Hyesoon Kim
Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña
Tags: Benchmarking, Code generation, Computer science, CUDA, nVidia, nVidia GeForce GTX Titan X, OpenACC, Package, PTX, Tesla K40, Tesla K80, Tesla V100
Seongyeon Park, Hajin Kim, Tanveer Ahmad, Nauman Ahmed, Zaid Al-Ars, H. Peter Hofstee, Youngsok Kim, Jinho Lee
Kun Wu, Mert Hidayetoğlu, Xiang Song, Sitao Huang, Da Zheng, Israt Nisa, Wen-mei Hwu
Jinfan Chen, Shigang Li, Ran Gun, Jinhui Yuan, Torsten Hoefler
Pablo F. Zubieta Rico, Ludwig Schneider, Gustavo Perez-Lemus, Riccardo Alessandri, Siva Dasetty, Cintia A. Menéndez, Yiheng Wu, Yezhi Jin, Trung Nguyen, John Parker, Andrew L. Ferguson, Juan J. de Pablo
Muhammad Osama, Serban D. Porumbescu, John D. Owens
Manos Pavlidakis, Stelios Mavridis, Antony Chazapis, Giorgos Vasiliadis, Angelos Bilas
Anna Fortenberry, Stanimire Tomov
December 25, 2022 by
hgpuMuhammad Osama
Tags: Algorithms, Computer science, CUDA, Linear Algebra, load balancing, Matrix multiplication, nVidia, nVidia A100, Package, Sparse, Thesis
December 25, 2022 by
hgpu