Hugh Leather, Chris Cummins
Deepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, Matei Zaharia
Tags: Computer science, CUDA, Deep learning, FPGA, GPU cluster, Heterogeneous systems, nVidia, Optimization, Task scheduling, Tesla K80, Tesla P100, Tesla V100
Cade Brown, Ahmad Abdelfattah, Stanimire Tomov, Jack Dongarra
Tags: AMD, AMD Radeon Instinct MI25, AMD Radeon Instinct Mi50, Benchmarking, Computer science, Heterogeneous systems, HIP, HPC, Linear Algebra, Performance, Portability
Zhixiang Ren, Yongheng Liu, Tianhui Shi, Lei Xie, Yue Zhou, Jidong Zhai, Youhui Zhang, Yunquan Zhang, Wenguang Chen
Tags: Algorithms, Artificial intelligence, Benchmarking, Computer science, CUDA, Heterogeneous systems, Machine learning, nVidia, nVidia GeForce GTX 1080 Ti, Package, Tesla V100
David J. Lusher, Satya P. Jammy, Neil D. Sandham
Tags: cfd, Code generation, CUDA, Fluid dynamics, GPU cluster, Heterogeneous systems, Numerical simulation, nVidia, OpenCL, OpenMP, OpenMPI, Package, Python, Tesla P100
Gina Yuan, Shoumik Palkar, Deepak Narayanan, Matei Zaharia
Usman Ahmed, Jerry Chun-Wei Lin, Gautam Srivastava, Muhammad Aleem
Sohan Lal, Aksel Alpay, Philip Salzmann, Biagio Cosenza, Alexander Hirsch, Nicolai Stawinoga, Peter Thoman, Thomas Fahringer, Vincent Heuveline
Tags: Benchmarking, Computer science, FPGA, Heterogeneous systems, nVidia, nVidia GeForce GTX Titan X, OpenCL, Package, Performance, PTX, SYCL
Yuanhang Yu, Dong Wen, Ying Zhang, Xiaoyang Wang, Wenjie Zhang, Xuemin Lin