Mario Mendez-Lojo, Martin Burtscher, Keshav Pingali
Yifeng Chen, Xiang Cui, Hong Mei
Tags: Code generation, Computer science, CUDA, FFT, GPU cluster, Heterogeneous systems, MPI, nVidia, Optimization, Performance, Programming techniques, Pthreads, Tesla C1060
Pablo Quesada-Barriuso, Julian Lamas-Rodriguez, Dora B. Heras, Montserrat Boo, Francisco Arguello
Prasanna Sattigeri, Jayaraman J. Thiagarajan, Karthikeyan N. Ramamurthy, Andreas Spanias
Jing-yu Cui, Guillem Pratx, Sven Prevrhal, Craig S. Levin
Vincent de Ladurantaye, Jean Lavoie, Jocelyn Bergeron, Maxime Parenteau, Huizhong Lu, Ramin Pichevar, Jean Rouat
Pritam Prakash Shete, Venkat P. P. K., S. K. Bose
David Gonzalez, Christian Sanchez, Ricardo Veguilla, Nayda G. Santiago, Samuel Rosario-Torres, Miguel Velez-Reyes
Jonatan Ward, Sergey Andreev, Francisco Heredia, Bogdan Lazar, Zlatka Manevska