Basile Lewandowski, Atli Kosson
Shigang Li, Kazuki Osawa, Torsten Hoefler
Moritz Lehmann, Mathias J. Krause, Giorgio Amati, Marcello Sega, Jens Harting, Stephan Gekle
Tags: AMD Radeon Instinct MI100, AMD Radeon VI, ATI, Fluid dynamics, lattice Boltzmann, Mixed precision, nVidia, OpenCL, Tesla K20, Tesla K40, Tesla K80, Tesla P100, Tesla V100
December 19, 2021 by
hgpuJennifer A. Loe, Christian A. Glusa, Ichitaro Yamazaki, Erik G. Boman, Sivasankaran Rajamanickam
September 19, 2021 by
hgpuBinrui Li, Shenggan Cheng, James Lin
Jie (Amy)Yang, Jianyu Huang, Jongsoo Park, Ping Tak Peter Tang, Andrew Tulloch
Thomas Faingnaert, Tim Besard, Bjorn De Sutter
Tags: Computer science, CUBLAS, CUDA, Julia, Machine learning, Mathematical Software, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2080 Ti, Package, Performance
Orestis Zachariadis, Nitin Satpute, Juan Gómez-Luna, Joaquín Olivares
Tags: Algorithms, Computer science, CUDA, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2070, nVidia Titan RTX, Package, Performance, Sparse matrix
Mawussi Zounon, Nicholas J. Higham, Craig Lucas, Françoise Tisseur
September 27, 2020 by
hgpuWeile Jia, Han Wang, Mohan Chen, Denghui Lu, Jiduan Liu, Lin Lin, Roberto Car, Weinan E, Linfeng Zhang
Tags: Computational Physics, CUDA, HPC, Machine learning, Mixed precision, Molecular dynamics, Molecular simulation, MPI, nVidia, OpenMP, Physics, Tesla V100