hgpu.org » Go
Derek L. Stinson
Tags: Computer science, CUDA, Deep learning, Go, nVidia, nVidia GeForce GTX 1080 Ti, Package, Thesis
May 17, 2020 by hgpu
Mirko Mariotti, Loriano Storchi, Daniele Spiga, Davide Salomoni, Tommaso Boccali, Daniele Bonacorsi
Tags: Computer science, FPGA, Go, Machine learning
December 8, 2019 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks
- Performance Portable Gradient Computations Using Source Transformation
- ConTraPh: Contrastive Learning for Parallelization and Performance Optimization
- Specx: a C++ task-based runtime system for heterogeneous distributed architectures
- Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks
- Understanding the Landscape of Ampere GPU Memory Errors
- Using Deep Reinforcement Learning for Automatic Code Optimization in the MLIR Compiler
- GBOTuner: Autotuning of OpenMP Parallel Codes with Bayesian Optimization and Code Representation Transfer Learning
- Kevin: Multi-Turn RL for Generating CUDA Kernels
- Pre-Training LLMs on a budget: A comparison of three optimizers
* * *