hgpu.org » Compielrs
Hongbin Zhang, Mingjie Xing, Yanjun Wu, Chen Zhao
Tags: Compielrs, Computer science, Deep learning, HLS, OpenCL, Package, survey
June 4, 2023 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization
- Scalable GPU-Based Integrity Verification for Large Machine Learning Models
- STARK: Strategic Team of Agents for Refining Kernels
- Tutoring LLM into a Better CUDA Optimizer
- INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
- Collective Communication for 100k+ GPUs
- An MLIR pipeline for offloading Fortran to FPGAs via OpenMP
- Enhancing Transformer Performance and Portability through Auto-tuning Frameworks
- RDMA Point-to-Point Communication for LLM Systems
- A Study of Floating-Point Precision Tuning in Deep Learning Operators Implementations
* * *




