hgpu.org » Linear Algbera
Samuel D. Relton, Pedro Valero-Lara, Mawussi Zounon
Tags: BLAS, Computer science, CUDA, Linear Algbera, nVidia, Package, Tesla K40
August 11, 2016 by hgpu
Nicolas Weber, Michael Goesele
Tags: Computer science, CUDA, Linear Algbera, nVidia, nVidia GeForce GTX Titan X, Package, Performance, Tesla K20
April 29, 2016 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe
- Revealing NVIDIA Closed-Source Driver Command Streams for CPU-GPU Runtime Behavior Insight
- MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
- Evaluating CUDA Tile for AI Workloads on Hopper and Blackwell GPUs
- Agentic Code Optimization via Compiler-LLM Cooperation
- FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- DVM: Real-Time Kernel Generation for Dynamic AI Models
- ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants
- Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
* * *




