Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen
Tags: Code generation, Computer science, CUDA, GEMM, Linear Algebra, Matrix multiplication, nVidia, nVidia A100, Package, Performance, Reliability, Tesla T4
Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Yunhe Feng, Xin Liang, Dingwen Tao, Franck Cappello
Erik A. Träff, Anton Rydahl, Sven Karlsson, Ole Sigmund, Niels Aage
Andrea Montessori, Marco Lauricella, Adriano Tiribocchi, Mihir Durve, Michele La Rocca, Giorgio Amati, Fabio Bonaccorso, Sauro Succi
Vsevolod Livinskii, Dmitry Babokin, John Regehr
Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Martin Swany, Dingwen Tao, Franck Cappello
Noel Chalmers, Jakub Kurzak, Damon McDougall, Paul T. Bauman
Filip Petrovič, Jiří Filipovič
Tags: Computer science, CUDA, nVidia, nVidia GeForce GTX 1070, nVidia GeForce GTX 680, nVidia GeForce GTX 750, nVidia GeForce RTX 2080 Ti, OpenCL, Package, Performance, Python, Vulkan
Ali TehraniJamsaz, Alok Mishra, Akash Dutta, Abid M. Malik, Barbara Chapman, Ali Jannesari
Yehonatan Fridman, Guy Tamir, Gal Oren
Diandian Gu, Xintong Xie, Gang Huang, Xin Jin, Xuanzhe Liu