John Lawson, Mehdi Goli, Duncan McBain, Daniel Soutar, Louis Sugy
Tags: AMD R9 Nano, ATI, BLAS, Computer science, Deep learning, Linear Algebra, Machine learning, Mathematical Software, OpenCL, Package, Performance, performance portability, SYCL
Paul Sathre, Mark Gardner, Wu-chun Feng
Tags: AMD FirePro S9150, ATI, Computer science, CUDA, FPGA, Intel Xeon Phi, nVidia, OpenCL, Package, performance portability, Tesla K80
Ada Sedova, Andreas Tillack, Arnold Tharrington
David Pfander, Gregor Daiss, Dirk Pfluger
Tags: Clustering, Computer science, Data mining, Distributed computing, Heterogeneous systems, Machine learning, MPI, nVidia, OpenCL, Package, performance portability, Tesla P100
February 10, 2019 by
hgpuTuowen Zhao, Samuel Williams, Mary Hall, Hans Johansen
December 16, 2018 by
hgpuAbigail Hsu, David Neill Asanza, Joseph A. Schoonover, Zach Jibben, Neil N. Carlson, Robert Robey
Beau Johnston, Greg Falzon, Josh Milthorpe
Tags: AMD FirePro S9150, AMD Radeon R9 290X, AMD Radeon R9 295X2, AMD Radeon RX 480, ATI, ATI Radeon HD 7970, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, nVidia GeForce GTX 1080, nVidia GeForce GTX 1080 Ti, OpenCL, Package, performance portability, Tesla K20, Tesla K40
Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Meghan Cowan, Haichen Shen, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy
Swen Boehm, Swaroop Pophale, Veronica G. Vergara Larrea, Oscar Hernandez
Tags: Benchmarking, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, OpenACC, OpenCL, OpenMP, performance portability, Tesla K20, Tesla V100
September 23, 2018 by
hgpuRaul Nozal, Jose Luis Bosque, Ramon Beivide
Maria Kotsifakou, Prakalp Srivastava, Matthew D. Sinclair, Rakesh Komuravelli, Vikram Adve, Sarita Adve