Markus Holzer, Martin Bauer, Harald Kostler, Ulrich Rüde
Tags: Algorithms, cfd, Code generation, CUDA, Fluid dynamics, Lattice Boltzmann model, MPI, nVidia, OpenCL, Package, Physics, Programming techniques, Tesla P100, Tesla V100
December 20, 2020 by
hgpuBeau Johnston, Jeffrey S. Vetter, Josh Milthorpe
Tags: AMD Radeon VII, ATI, Benchmarking, Computer science, CUDA, Heterogeneous systems, HIP, Matrix multiplication, nVidia, OpenCL, Package, Performance, Tesla P100
November 29, 2020 by
hgpuJohn Brennan, Stephen Bonner, Amir Atapour-Abarghouei, Philip T Jackson, Boguslaw Obara, Andrew Stephen McGough
Tags: CNN, Computer science, CUDA, Deep learning, Machine learning, Neural networks, nVidia, nVidia Titan RTX, PyTorch, Tesla P100, Tesla V100
Aashaka Shah, Chao-Yuan Wu, Jayashree Mohan, Vijay Chidambaram, Philipp Krähenbühl
Supun Nakandala Karla Saur, Gyeong-In Yu, Konstantinos Karanasos, Carlo Curino, Markus Weimer, Matteo Interlandi
Mawussi Zounon, Nicholas J. Higham, Craig Lucas, Françoise Tisseur
September 27, 2020 by
hgpuRyuichi Sai, John Mellor-Crummey, Xiaozhu Meng, Mauricio Araya-Polo, Jie Meng
September 13, 2020 by
hgpuDeepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, Matei Zaharia
Tags: Computer science, CUDA, Deep learning, FPGA, GPU cluster, Heterogeneous systems, nVidia, Optimization, Task scheduling, Tesla K80, Tesla P100, Tesla V100
Aditya Agarwal, Yupeng Han, Maxim Likhachev
David J. Lusher, Satya P. Jammy, Neil D. Sandham
Tags: cfd, Code generation, CUDA, Fluid dynamics, GPU cluster, Heterogeneous systems, Numerical simulation, nVidia, OpenCL, OpenMP, OpenMPI, Package, Python, Tesla P100