Richard Schoonhoven, Ben van Werkhoven, Kees Joost Batenburg
Tags: AMD Radeon Instinct Mi50, ATI, Auto-Tuning, Benchmarking, Computer science, CUDA, nVidia, nVidia A100, nVidia GeForce GTX 1080 Ti, nVidia GeForce GTX Titan X, nVidia Titan RTX, OpenCL, Performance, pyCUDA, PyOpenCL, Tesla K20, Tesla P100, Tesla V100
Tao Lu, Chengkun Wei, Ruijing Yu, Yi Chen, Li Wang, Chaochao Chen, Zeke Wang, and Wenzhi Chen
Jianjing An, Dezheng Zhang, Ke Xu, Dong Wang
Tags: Computational Complexity, Computer science, Deep learning, Design space exploration, FPGA, Neural networks, nVidia, OpenCL, Package, RNN, Tesla K40
Shigang Li, Kazuki Osawa, Torsten Hoefler
Jieyang Chen, Chenhao Xie, Jesun S Firoz, Jiajia Li, Shuaiwen Leon Song, Kevin Barker, Mark Raugas, Ang Li
Wael Elwasif, Sergei Bastrakov, Spencer H. Bryngelson, Michael Bussmann, Sunita Chandrasekaran, Florina Ciorba, M. A. Clark, Alexander Debus, William Godoy, Nick Hagerty, Jeff Hammond, David Hardy, J. Austin Harris, Oscar Hernandez, Balint Joo, Sebastian Keller, Paul Kent, Henry Le Berre, Damien Lebrun-Grandie, Elijah MacCarthy, Verónica G. Melesse Vergara, Bronson Messer, Ross Miller, Sarp Oral, Jean-Guillaume Piccinali, Anand Radhakrishnan, Osman Simsek, Filippo Spiga, Klaus Steiniger, Jan Stephan, John E. Stone, Christian Trott, René Widera, Jeffrey Young
Tags: ARM, Benchmarking, cfd, Computer science, CUDA, Fluid dynamics, Fortran, HPC, MPI, nVidia, nVidia A100, OpenACC, Package
Shilei Tian, Joseph Huber, Konstantinos Parasyris, Barbara Chapman, Johannes Doefert
September 11, 2022 by
hgpuJiangsu Du, Ziming Liu, Jiarui Fang, Shenggui Li, Yongbin Li, Yutong Lu, Yang You
September 11, 2022 by
hgpuGenghan Zhang, Yuetong Zhao, Yanting Tao, Zhongming Yu, Guohao Dai, Sitao Huang, Yuan Wen, Pavlos Petoumenos, Yu Wang
September 11, 2022 by
hgpuRe'em Harel, Matan Rusanovsky, Ron Wagner, Harel Levin, Gal Oren
September 11, 2022 by
hgpu