Keichi Takahashi, Wassapon Watanakeesuntorn, Kohei Ichikawa, Joseph Park, Ryousei Takano, Jason Haga, George Sugihara, Gerald M. Pao
Alexandros Nikolaos Ziogas, Tal Ben-Nun, Timo Schneider, Torsten Hoefler
Wenshuo Li, Hanting Chen, Mingqiang Huang, Xinghao Chen, Chunjing Xu, Yunhe Wang
Hamid Tabani, Fabio Mazzocchetti, Pedro Benedicte, Jaume Abella, Francisco J. Cazorla
Meriam Dhouibi, Ahmed Karim Ben Salem, Afef Saidi, Slim Ben Saoud
Xiaoyan Liu, Yi Liu, Ming Dun, Bohong Yin, Hailong Yang, Zhongzhi Luan, Depei Qian
Yuhsiang M. Tsai, Terry Cojean, Hartwig Anzt
Tags: AMD Radeon VII, ATI, Benchmarking, Computer science, HIP, Linear Algebra, nVidia, Package, Performance, Sparse, Sparse matrix, Tesla V100
Riyadh Baghdadi, Massinissa Merouani, Mohamed-Hicham Leghettas, Kamel Abdous, Taha Arbaoui, Karima Benatchba, Saman Amarasinghe
Jiří Filipovič, Jana Hozzová, Amin Nezarat, Jaroslav Oľha, Filip Petrovič
Tags: Auto-Tuning, Computer science, CUDA, HPC, Machine learning, nVidia, nVidia GeForce GTX 1070, nVidia GeForce RTX 2080, OpenCL, Package, Performance, Vulkan
February 23, 2021 by
hgpuGeoffrey X. Yu, Yubo Gao, Pavel Golikov, Gennady Pekhimenko
Tags: Computer science, CUDA, Deep learning, Machine learning, Neural networks, nVidia, nVidia GeForce RTX 2070, Performance, Python, Tesla P100, Tesla V100