hgpu.org » Go
Derek L. Stinson
Tags: Computer science, CUDA, Deep learning, Go, nVidia, nVidia GeForce GTX 1080 Ti, Package, Thesis
May 17, 2020 by hgpu
Mirko Mariotti, Loriano Storchi, Daniele Spiga, Davide Salomoni, Tommaso Boccali, Daniele Bonacorsi
Tags: Computer science, FPGA, Go, Machine learning
December 8, 2019 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Compiler and Runtime Systems for Generative AI Models
- Scalable GPU-Based Integrity Verification for Large Machine Learning Models
- STARK: Strategic Team of Agents for Refining Kernels
- Tutoring LLM into a Better CUDA Optimizer
- INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
- Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs
- Adaptivity in AdaptiveCpp: Optimizing Performance by Leveraging Runtime Information During JIT-Compilation
- Collective Communication for 100k+ GPUs
- Enhancing Transformer Performance and Portability through Auto-tuning Frameworks
- CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization
* * *




