hgpu.org » nVidia GeForce GTX 620
Husheng Zhou
Tags: Computer science, CUDA, Deep learning, Heterogeneous systems, Neural networks, nVidia, nVidia GeForce GTX 480, nVidia GeForce GTX 620, nVidia GeForce GTX 660, nVidia Jetson TX2, nVidia Quadro 6000, Tesla K80, Thesis
May 12, 2019 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Revealing NVIDIA Closed-Source Driver Command Streams for CPU-GPU Runtime Behavior Insight
- CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe
- MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
- Evaluating CUDA Tile for AI Workloads on Hopper and Blackwell GPUs
- Agentic Code Optimization via Compiler-LLM Cooperation
- FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- DVM: Real-Time Kernel Generation for Dynamic AI Models
- ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants
- Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
* * *



