hgpu.org » speedup
Ricardo Nobre, Tiago Carneiro, Marcos Negreiros, Felipe Martins Muller
October 8, 2014 by rhnobre
Recent source codes
* * *
Most viewed papers (last 30 days)
- CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe
- MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
- Agentic Code Optimization via Compiler-LLM Cooperation
- DVM: Real-Time Kernel Generation for Dynamic AI Models
- Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
* * *



