Tim Lühnen, Tobias Marschner, Sohan Lal
Haining Tong, Natalia Gavrilenko, Hernán Ponce de León, Keijo Heljanko
Xinyi Li, Ang Li, Bo Fang, Katarzyna Swirydowicz, Ignacio Laguna, Ganesh Gopalakrishnan
Tags: AMD Radeon Instinct MI100, AMD Radeon Instinct MI250X, ATI, Computer science, Hardware Architecture, HPC, Matrix multiplication, nVidia, nVidia A100, nVidia H100, nVidia V100, PTX
Hojin Choi, SeongJun Choi, SeogChung Seo
Weile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Qiang Wang, Xiaowen Chu
Tags: Artificial intelligence, Benchmarking, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia H800, Performance, PTX
February 25, 2024 by
hgpuFernando Fernandes dos Santos, Luigi Carro, Flavio Vella, Paolo Rech
Gargi Alavani, Santonu Sarkar
Tags: Computer science, CUDA, Energy-efficient computing, Java, Machine learning, nVidia, Package, Performance, PTX, Tesla K20, Thesis
Viktor Franzén, Carl Östling
Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña
Tags: Benchmarking, Code generation, Computer science, CUDA, nVidia, nVidia GeForce GTX Titan X, OpenACC, Package, PTX, Tesla K40, Tesla K80, Tesla V100