Joao Paulo Tarasconi Ruschel
Tags: Algorithms, Benchmarking, Computer science, CUDA, Linear Algebra, Matrix decomposition, nVidia, OpenCL, OpenMP, Package, Performance, Tesla K80, Thesis
Utku Aydonat, Shane O'Connell, Davor Capalija, Andrew C. Ling, Gordon R. Chiu
Chenhan D. Yu, William B. March, George Biros
Hasitha Muthumala Waidyasooriya, Masanori Hariyama, Kota Kasahara
Yann N. Dauphin, Angela Fan, Michael Auli, David Grangier
December 26, 2016 by
hgpuFabio Baruffa, Luigi Iapichino, Nicolay J. Hammer, Vasileios Karakasis
December 20, 2016 by
hgpuFarhad Merchant, Tarun Vatwani, Anupam Chattopadhyay, Soumyendu Raha, S K Nandy, Ranjani Narayan
Tags: Algorithms, Computer science, CUDA, Factorization, FPGA, Linear Algebra, Mathematical Software, Matrix multiplication, nVidia, Performance, Tesla C2050
December 17, 2016 by
hgpuRichard Michael Veras, Tze Meng Low, Tyler Michael Smith, Robert van de Geijn, Franz Franchetti
December 14, 2016 by
hgpuMichael Sutton, Tal Ben-Nun, Amnon Barak, Sreepathi Pai, Keshav Pingali
December 10, 2016 by
hgpu