Jiaqi Lv, Xufeng He, Yanchen Liu, Xu Dai, Yang Hu, Shouyi Yin
Tags: AI, Benchmarking, Compilers, Computer science, CUDA, Deep learning, LLM, nVidia, nVidia A100, Package, performance portability
Dewei Wang, Wei Zhu, Liyang Ling, Ettore Tiotto, Quintin Wang, Whitney Tsang, Julian Opperman, Jacky Deng
Mengyue Xi, Tianyu Guo, Xuanteng Huang, Zejia Lin, Xianwei Zhang
Richard Schulze, Sergei Gorlatch, Ari Rasch
Pau López Castillón, Xavier Caricchio Hernández, Leonidas Kosmidis
Robert Szafarczyk, Syed Waqar Nabi, Wim Vanderbauwhede
February 10, 2025 by
hgpuManos Pavlidakis, Chris Kitching, Nicholas Tomlinson, Michael Søndergaard
Mary Hall, Cosmin Oancea, Anne C. Elster, Ari Rasch, Sameeran Joshi, Amir Mohammad Tavakkoli, Richard Schulze
Aaron Jarmusch, Felipe Cabarcas, Swaroop Pophale, Andrew Kallai, Johannes Doerfert, Luke Peyralans, Seyong Lee, Joel Denny, Sunita Chandrasekaran
Tags: AMD Radeon Instinct MI210, AMD Radeon Instinct MI250X, ATI, Compilers, Computer science, Fortran, Heterogeneous systems, HPC, nVidia, nVidia H100, OpenMP