Taesu Kim, Jongho Lee, Daehyun Ahn, Sarang Kim, Jiwoong Choi, Minkyu Kim, Hyungjun Kim
Tags: Computer science, CUDA, Deep learning, Machine learning, Matrix multiplication, Mixed precision, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia RTX A6000, Package
February 18, 2024 by
hgpuBasile Lewandowski, Atli Kosson
Shigang Li, Kazuki Osawa, Torsten Hoefler
Moritz Lehmann, Mathias J. Krause, Giorgio Amati, Marcello Sega, Jens Harting, Stephan Gekle
Tags: AMD Radeon Instinct MI100, AMD Radeon VI, ATI, Fluid dynamics, lattice Boltzmann, Mixed precision, nVidia, OpenCL, Tesla K20, Tesla K40, Tesla K80, Tesla P100, Tesla V100
December 19, 2021 by
hgpuJennifer A. Loe, Christian A. Glusa, Ichitaro Yamazaki, Erik G. Boman, Sivasankaran Rajamanickam
September 19, 2021 by
hgpuBinrui Li, Shenggan Cheng, James Lin
Jie (Amy)Yang, Jianyu Huang, Jongsoo Park, Ping Tak Peter Tang, Andrew Tulloch
Thomas Faingnaert, Tim Besard, Bjorn De Sutter
Tags: Computer science, CUBLAS, CUDA, Julia, Machine learning, Mathematical Software, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2080 Ti, Package, Performance
Orestis Zachariadis, Nitin Satpute, Juan Gómez-Luna, Joaquín Olivares
Tags: Algorithms, Computer science, CUDA, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2070, nVidia Titan RTX, Package, Performance, Sparse matrix