hgpu.org » Approximate computing and storage
Stefano Cherubin, Giovanni Agosta
Tags: Approximate computing and storage, Computer science, CUDA, HPC, Mixed precision, nVidia, OpenCL, Precision, survey
May 3, 2020 by hgpu
Sparsh Mittal
January 14, 2016 by sparsh0mittal
Recent source codes
* * *
Most viewed papers (last 30 days)
- Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks
- Performance Portable Gradient Computations Using Source Transformation
- ConTraPh: Contrastive Learning for Parallelization and Performance Optimization
- Specx: a C++ task-based runtime system for heterogeneous distributed architectures
- Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks
- Understanding the Landscape of Ampere GPU Memory Errors
- Using Deep Reinforcement Learning for Automatic Code Optimization in the MLIR Compiler
- GBOTuner: Autotuning of OpenMP Parallel Codes with Bayesian Optimization and Code Representation Transfer Learning
- Kevin: Multi-Turn RL for Generating CUDA Kernels
- Pre-Training LLMs on a budget: A comparison of three optimizers
* * *