Burkhard Ringlein, Thomas Parnell, Radu Stoica
Tags: AMD Radeon Instinct MI250, ATI, Auto-Tuning, Computer science, CUDA, DSL, HIP, LLM, nVidia, nVidia A100, Performance, performance portability
Neha Prakriya, Zijian Ding, Yizhou Sun, Jason Cong
Aashaka Shah, Abhinav Jangda, Binyang Li, Caio Rocha, Changho Hwang, Jithin Jose, Madan Musuvathi, Olli Saarikivi, Peng Cheng, Qinghua Zhou, Roshan Dathathri, Saeed Maleki, Ziyue Yang
Tags: AI, AMD Radeon Instinct MI300X, ATI, Computer science, CUDA, Heterogeneous systems, HIP, nVidia, nVidia A100, nVidia H100, Package
Radostin Stoyanov, Viktória Spišaková, Jesus Ramos, Steven Gurfinkel, Andrei Vagin, Adrian Reber, Wesley Armour, Rodrigo Bruno
Tags: AMD Radeon Instinct MI210, ATI, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia H100, nVidia RTX A6000, Package, ROCm
Rahulkumar Gayatri, Shilei Tian, Stephen Olivier, Johannes Doerfert, Eric Wright
Tags: AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, HIP, MPI, nVidia, nVidia A100, OpenMP, Package, performance portability
February 16, 2025 by
hgpuYichao Yuan, Advait Iyer, Lin Ma, Nishil Talati
February 16, 2025 by
hgpuManos Pavlidakis, Chris Kitching, Nicholas Tomlinson, Michael Søndergaard
Gregor Daiß, Patrick Diehl, Jiakun Yan, John K. Holmen, Rahulkumar Gayatri, Christoph Junghans, Alexander Straub, Jeff R. Hammond, Dominic Marcello, Miwako Tsuji, Dirk Pflüger, Hartmut Kaiser
Tags: AMD Radeon Instinct MI100, AMD Radeon Instinct MI250X, Astrophysics, ATI, Computer science, CUDA, Heterogeneous systems, HIP, HPC, nVidia, nVidia A100, Package, performance portability, Physics
December 29, 2024 by
hgpuManuel Costanzo, Enzo Rucci, Carlos García-Sánchez, Marcelo Naiouf, Manuel Prieto-Matías
Tags: AMD Radeon RX 6700 XT, AMD Radeon RX Vega 6, ATI, Bioinformatics, Biology, Computer science, CUDA, Databases, Heterogeneous systems, HPC, Intel, Intel Arc A770, Intel UHD 630, Intel UHD 770, nVidia, nVidia GeForce GTX 1080, nVidia GeForce GTX 980, nVidia GeForce RTX 2070, nVidia GeForce RTX 3070, nVidia GeForce RTX 3090, oneAPI, Package, performance portability, SYCL, Tesla V100
December 15, 2024 by
hgpuYohei Miki, Toshihiro Hanawa
Tags: AMD Radeon Instinct MI210, ATI, Computer science, Diffusion equation, Intel, Intel Ponte Vecchio Max 1100, N-body simulation, nVidia, nVidia GH200, nVidia H100, OpenACC, OpenMP, Package
Sanil Rao, Mike Franusich, , Mohammad Alaul Haque Monil, Het Mankad, Jeffrey S. Vetter, Franz Franchetti
Tags: AMD, ATI, Code generation, Computer science, CUDA, Differential equations, Fortran, Heterogeneous systems, HIP, nVidia, OpenCL, OpenMP, Partial differential equations, PDEs