Gargi Alavani, Santonu Sarkar
Tags: Computer science, CUDA, Energy-efficient computing, Java, Machine learning, nVidia, Package, Performance, PTX, Tesla K20, Thesis
Viktor Franzén, Carl Östling
Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña
Tags: Benchmarking, Code generation, Computer science, CUDA, nVidia, nVidia GeForce GTX Titan X, OpenACC, Package, PTX, Tesla K40, Tesla K80, Tesla V100
Hamdy Abdelkhalik, Yehia Arafa, Nandakishore Santhi, Abdel-Hameed Badawy
Wei Sun, Ang Li, Tong Geng, Sander Stuijk, Henk Corporaal
Devashree Tripathy, AmirAli Abdolrashidi, Quan Fan, Daniel Wong, Manoranjan Satpathy
September 5, 2021 by
hgpuSohan Lal, Aksel Alpay, Philip Salzmann, Biagio Cosenza, Alexander Hirsch, Nicolai Stawinoga, Peter Thoman, Thomas Fahringer, Vincent Heuveline
Tags: Benchmarking, Computer science, FPGA, Heterogeneous systems, nVidia, nVidia GeForce GTX Titan X, OpenCL, Package, Performance, PTX, SYCL
Somashekaracharya G. Bhaskaracharya, Julien Demouth, Vinod Grover
Tags: Compilers, Computer science, CUBLAS, CUDA, Deep learning, Matrix multiplication, nVidia, nVidia Quadro GV100, Performance, Programming Languages, PTX