hgpu.org » CPU cluster
David Clarke, Aleksandar Ilic, Alexey Lastovetsky, Leonel Sousa
Tags: Computer science, CPU cluster, GPU cluster, Heterogeneous systems, Matrix multiplication, nVidia, Tesla C2050, Tesla T10
June 5, 2012 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe
- MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
- Agentic Code Optimization via Compiler-LLM Cooperation
- DVM: Real-Time Kernel Generation for Dynamic AI Models
- Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
* * *



