Tarun Beri, Sorav Bansal, Subodh Kumar
Tags: Computer science, CUDA, FFT, GPU cluster, Heterogeneous systems, Matrix multiplication, Memory model, nVidia, Prefetch, Task scheduling, Tesla M2070
February 11, 2014  by 
hgpuAngeles Navarro, Antonio Vilches, Francisco Corbera, Rafael Asenjo
Farouk Mansouri, Sylvain Huet, Vincent Fristot, Dominique Houzet
Xavier Lacoste, Mathieu Faverge, Pierre Ramet, Samuel Thibault, George Bosilca
Jing Zhang, Hao Wang, Heshan Lin, Wu-chun Feng
Konstantinos Krommydas, Thomas R.W. Scogland, Wu-chun Feng
Jason Power, Joel Hestness, Marc S. Orr, Mark D. Hill, David A. Wood