Jinyuan Yang, Soumyabrata Dev, Abraham G. Campbell
September 22, 2024 by
hgpuJunqing Lin, Jingwei Sun, Xiaolong Shi, Honghe Zhang, Xianzhi Yu, Xinzhi Wang, Jun Yao, Guangzhong Sun
Tags: Compilers, Computer science, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce RTX 2080 Ti, Performance, Sparse matrix, Tesla V100
Yizhou Luo, Qiang Wang, Shaohuai Shi, Jiaxin Lai, Shuhan Qi, Jiajia Zhang, Xuan Wang
Floris-Jan Willemsen, Richard Schoonhoven, Jiří Filipovič, Jacob O. Tørring, Rob van Nieuwpoort, Ben van Werkhoven
Hojin Choi, SeongJun Choi, SeogChung Seo
Jiacheng Yang, Christina Giannoula, Jun Wu, Mostafa Elhoushi, James Gleeson, Gennady Pekhimenko
Tags: Cloud, Computer science, CUDA, Matrix multiplication, nVidia, nVidia GeForce RTX 2070, nVidia GeForce RTX 2080 Ti, nVidia GeForce RTX 3090, Package, Performance, PyTorch, Tesla A100
Ashwina Kumar, M. Venkata Krishna, Prasanna Bartakke, Rahul Kumar, Rajesh Pandian M, Nibedita Behera, Rupesh Nasre
Tags: Code generation, Computer science, CUDA, DSL, nVidia, nVidia GeForce RTX 2080 Ti, OpenACC, OpenCL, Package, SYCL, Tesla V100
Jan Solanti, Michal Babej, Julius Ikkala, Pekka Jääskeläinen
September 6, 2023 by
hgpuMahmood Naderan-Tahan, Hossein SeyyedAghaei, Lieven Eeckhout
Bala Gurumurthy, David Broneske, Martin Schäler, Thilo Pionteck
Tags: Computer science, Databases, Hashing, Heterogeneous systems, nVidia, nVidia A100, nVidia GeForce GTX 1050 Ti, nVidia GeForce RTX 2080 Ti, nVidia V100, OpenCL, Package, Sorting
Ryan R. Curtin, Marcus Edel, Conrad Sanderson