Shun-ichiro Hayashi, Koki Morita, Daichi Mukunoki, Tetsuya Hoshino, Takahiro Katagiri
Tags: AI, Code generation, Computer science, CUDA, HPC, LLM, nVidia, OpenACC, OpenMP, Package, Tesla V100
William F. Godoy, Tatiana Melnichenko, Pedro Valero-Lara, Wael Elwasif, Philip Fackler, Rafael Ferreira Da Silva, Keita Teranishi, Jeffrey S. Vetter
Tags: AI, AMD Radeon Instinct MI300A, ATI, Compilers, Computer science, CUDA, HIP, HPC, nVidia, nVidia H100, Package, Python, ROCm
September 28, 2025 by
hgpuShihan Fang, Hongzheng Chen, Niansong Zhang, Jiajie Li, Han Meng, Adrian Liu, Zhiru Zhang
September 21, 2025 by
hgpuRobert Tjarko Lange, Qi Sun, Aaditya Prasad, Maxence Faldor, Yujin Tang, David Ha
September 21, 2025 by
hgpuAndreas Herten, Olga Pearce, Filipe S. M. Guimarães
Tags: Benchmarking, Computer science, CUDA, Fortran, HIP, HPC, MPI, OpenACC, OpenCL, OpenMP, Package, Performance, ROCm, SYCL
September 14, 2025 by
hgpuNripesh Niketan, Vaatsalya Shrivastva
Tags: Compilers, Computer science, CUDA, DirectX, GLSL, HIP, HLSL, nVidia, OpenGL, Package, Programming Languages, Vulkan
September 7, 2025 by
hgpuDavid Jin, Alexis Montoison, Sungho Shin
Tags: AMD Radeon Instinct MI300X, ATI, Benchmarking, BLAS, Computer science, CUDA, Factorization, Julia, nVidia, nVidia H200, Package, ROCm
September 7, 2025 by
hgpuXiyan Hu, Titus Parker, Connor Phillips, Yifa Yu
Tags: AMD Radeon Instinct MI210, AMD Radeon Instinct MI325X, ATI, Computer science, HIP, nVidia, nVidia A100, Package, Performance, PyTorch, ROCm, Thesis
Jacob Wahlgren, Gabin Schieffer, Ruimin Shi, Edgar A. León, Roger Pearce, Maya Gokhale, Ivy Peng