hgpu.org » LLM
Ali Doosthosseini, Jonathan Decker, Hendrik Nolte, Julian M. Kunkel
Tags: AI, Cloud, Computer science, HPC, LLM, nVidia, nVidia H100, Package, PC cluster
July 7, 2024 by hgpu
Avinash Maurya, Jie Ye, M. Mustafa Rafique, Franck Cappello, Bogdan Nicolae
Tags: Computer science, CUDA, LLM, Memory, nVidia, nVidia A100, nVidia H100, Performance
June 23, 2024 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- Hardware Acceleration for Neural Networks: A Comprehensive Survey
- cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- BoltzGen:Toward Universal Binder Design
- Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation
- The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers
- AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization
- SeedFold: Scaling Biomolecular Structure Prediction
* * *




