Shiwei Zhang, Lansong Diao, Chuan Wu, Zongyan Cao, Siyu Wang, Wei Lin
Tags: Computer science, CUDA, Deep learning, Distributed computing, GPU cluster, nVidia, nVidia A100, nVidia P100, nVidia V100, Package, PyTorch
John Jacobson, Martin Burtscher, Ganesh Gopalakrishnan
Wei-Chen Lin, Simon McIntosh-Smith, Tom Deakin
Foteini Strati, Xianzhe Ma, Ana Klimovic
Ashwina Kumar, M. Venkata Krishna, Prasanna Bartakke, Rahul Kumar, Rajesh Pandian M, Nibedita Behera, Rupesh Nasre
Tags: Code generation, Computer science, CUDA, DSL, nVidia, nVidia GeForce RTX 2080 Ti, OpenACC, OpenCL, Package, SYCL, Tesla V100
Tal Kadosh, Niranjan Hasabnis, Vy A. Vo, Nadav Schneider, Neva Krien, Mihai Capota, Abdul Wasay, Nesreen Ahmed, Ted Willke, Guy Tamir, Yuval Pinter, Timothy Mattson, Gal Oren
Tom T.P. Franken, Thomas Neele, Jan Friso Groote
Biyao Che, Zixiao Wang, Ying Chen, Liang Guo, Yuan Liu, Yuan Tian, Jizhuang Zhao
Mohammad Zubair, Aaron Walden, Gabriel Nastac, Eric Nielsen, Christoph Bauinger, Xiao Zhu
December 31, 2023 by
hgpuAristotle Martin, Geng Liu, William Ladd, Seyong Lee, John Gounley, Jeffrey Vetter, Saumil Patel, Silvio Rizzi, Victor Mateevitsi, Joseph Insley, Amanda Randles
Tags: AMD Radeon Instinct MI250X, ATI, cfd, CUDA, Fluid dynamics, Heterogeneous systems, HIP, nVidia, nVidia A100, nVidia V100, performance portability, SYCL
December 31, 2023 by
hgpu