hgpu.org » Intel Gaudi-2
Yunjae Lee, Juntaek Lim, Jehyeon Bang, Eunyeong Cho, Huijong Jeong, Taesu Kim, Hyungjun Kim, Joonhyung Lee, Jinseop Im, Ranggi Hwang, Se Jung Kwon, Dongsoo Lee, Minsoo Rhu
Tags: AI, Benchmarking, Computer science, CUDA, Intel, Intel Gaudi-2, nVidia, nVidia A100, Performance
January 6, 2025 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Data-efficient LLM Fine-tuning for Code Generation
- LithOS: An Operating System for Efficient Machine Learning on GPUs
- Dynamic Memory Management on GPUs with SYCL
- LIFT: LLM-Based Pragma Insertion for HLS via GNN Supervised Fine-Tuning
- MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications
- Efficient deep learning inference on end devices
- DeepCompile: A Compiler-Driven Approach to Optimizing Distributed Deep Learning Training
- InteropUnityCUDA: A Tool for Interoperability Between Unity and CUDA
- Mìmir: A real-time interactive visualization library for CUDA programs
- Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration
* * *