hgpu.org » Go
Derek L. Stinson
Tags: Computer science, CUDA, Deep learning, Go, nVidia, nVidia GeForce GTX 1080 Ti, Package, Thesis
May 17, 2020 by hgpu
Mirko Mariotti, Loriano Storchi, Daniele Spiga, Davide Salomoni, Tommaso Boccali, Daniele Bonacorsi
Tags: Computer science, FPGA, Go, Machine learning
December 8, 2019 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators
- Analyzing the Impact of Kernel Fusion on GPU Tensor Operation Performance: A Systematic Performance Study
- CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging
- Source-to-Source Transformations for GPU Code Generation
- Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
- CodegenBench: Can LLMs Write Efficient Code Across Architectures?
* * *




