hgpu.org » survey
Sparsh Mittal, Saket Gupta, Sudeb Dasgupta
Tags: DSP, FPGA, Image processing, survey
November 19, 2014 by sparsh0mittal
Recent source codes
* * *
Most viewed papers (last 30 days)
- Leveraging LLVM OpenMP GPU Offload Optimizations for Kokkos Applications
- cuSZp2: A GPU Lossy Compressor with Extreme Throughput and Optimized Compression Ratio
- pyATF: Constraint-Based Auto-Tuning in Python
- KernelBench: Can LLMs Write Efficient GPU Kernels?
- Seamless acceleration of Fortran intrinsics via AMD AI engines
- The AI CUDA Engineer: Agentic CUDA Kernel Discovery, Optimization and Composition
- A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading
- Compiler Support for Speculation in Decoupled Access/Execute Architectures
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators
- Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs
* * *