Stefano Corda, Bram Veenboer, Emma Tolley
Hanqiu Chen, Yahya Alhinai, Yihan Jiang, Eunjee Na, Cong Hao
Zhibo Liu, Yuanyuan Yuan, Shuai Wang, Xiaofei Xie, Lei Ma
Richard Schoonhoven, Ben van Werkhoven, Kees Joost Batenburg
Tags: AMD Radeon Instinct Mi50, ATI, Auto-Tuning, Benchmarking, Computer science, CUDA, nVidia, nVidia A100, nVidia GeForce GTX 1080 Ti, nVidia GeForce GTX Titan X, nVidia Titan RTX, OpenCL, Performance, pyCUDA, PyOpenCL, Tesla K20, Tesla P100, Tesla V100
Tao Lu, Chengkun Wei, Ruijing Yu, Yi Chen, Li Wang, Chaochao Chen, Zeke Wang, and Wenzhi Chen
Jianjing An, Dezheng Zhang, Ke Xu, Dong Wang
Tags: Computational Complexity, Computer science, Deep learning, Design space exploration, FPGA, Neural networks, nVidia, OpenCL, Package, RNN, Tesla K40
Shigang Li, Kazuki Osawa, Torsten Hoefler
Jieyang Chen, Chenhao Xie, Jesun S Firoz, Jiajia Li, Shuaiwen Leon Song, Kevin Barker, Mark Raugas, Ang Li
Wael Elwasif, Sergei Bastrakov, Spencer H. Bryngelson, Michael Bussmann, Sunita Chandrasekaran, Florina Ciorba, M. A. Clark, Alexander Debus, William Godoy, Nick Hagerty, Jeff Hammond, David Hardy, J. Austin Harris, Oscar Hernandez, Balint Joo, Sebastian Keller, Paul Kent, Henry Le Berre, Damien Lebrun-Grandie, Elijah MacCarthy, Verónica G. Melesse Vergara, Bronson Messer, Ross Miller, Sarp Oral, Jean-Guillaume Piccinali, Anand Radhakrishnan, Osman Simsek, Filippo Spiga, Klaus Steiniger, Jan Stephan, John E. Stone, Christian Trott, René Widera, Jeffrey Young
Tags: ARM, Benchmarking, cfd, Computer science, CUDA, Fluid dynamics, Fortran, HPC, MPI, nVidia, nVidia A100, OpenACC, Package