hgpu.org » Intel Gaudi-2
Yunjae Lee, Juntaek Lim, Jehyeon Bang, Eunyeong Cho, Huijong Jeong, Taesu Kim, Hyungjun Kim, Joonhyung Lee, Jinseop Im, Ranggi Hwang, Se Jung Kwon, Dongsoo Lee, Minsoo Rhu
Tags: AI, Benchmarking, Computer science, CUDA, Intel, Intel Gaudi-2, nVidia, nVidia A100, Performance
January 6, 2025 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Asynchronous-Many-Task Systems: Challenges and Opportunities - Scaling an AMR Astrophysics Code on Exascale machines using Kokkos and HPX
- Scalable Access-Pattern Aware I/O Acceleration and Multi-Tiered Data Management for HPC and AI Workloads
- Towards Performance-Aware Allocation for Accelerated Machine Learning on GPU-SSD Systems
- HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages
- A comparison of HPC-based quantum computing simulators using Quantum Volume
- A survey on FPGA-based accelerator for ML models
- TorchQC - A framework for efficiently integrating machine and deep learning methods in quantum dynamics and control
- Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with Protein Database Search
- Reproducible Study and Performance Analysis of GPU Programming Paradigms: OpenACC vs. CUDA in Key Linear Algebra Computations
- Utilizing Tensor Cores in Futhark
* * *