Dewei Wang, Wei Zhu, Liyang Ling, Ettore Tiotto, Quintin Wang, Whitney Tsang, Julian Opperman, Jacky Deng
Mengyue Xi, Tianyu Guo, Xuanteng Huang, Zejia Lin, Xianwei Zhang
Richard Schulze, Sergei Gorlatch, Ari Rasch
Pau López Castillón, Xavier Caricchio Hernández, Leonidas Kosmidis
Robert Szafarczyk, Syed Waqar Nabi, Wim Vanderbauwhede
February 10, 2025 by
hgpuManos Pavlidakis, Chris Kitching, Nicholas Tomlinson, Michael Søndergaard
Mary Hall, Cosmin Oancea, Anne C. Elster, Ari Rasch, Sameeran Joshi, Amir Mohammad Tavakkoli, Richard Schulze
Aaron Jarmusch, Felipe Cabarcas, Swaroop Pophale, Andrew Kallai, Johannes Doerfert, Luke Peyralans, Seyong Lee, Joel Denny, Sunita Chandrasekaran
Tags: AMD Radeon Instinct MI210, AMD Radeon Instinct MI250X, ATI, Compilers, Computer science, Fortran, Heterogeneous systems, HPC, nVidia, nVidia H100, OpenMP
Junqing Lin, Jingwei Sun, Xiaolong Shi, Honghe Zhang, Xianzhi Yu, Xinzhi Wang, Jun Yao, Guangzhong Sun
Tags: Compilers, Computer science, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce RTX 2080 Ti, Performance, Sparse matrix, Tesla V100