hgpu.org » Embedded high-performance computing
Kulin V. Seth
Tags: Benchmarking, Computer science, DSP, Embedded high-performance computing, Heterogeneous systems, OpenCL, Optimization, Thesis
September 23, 2011 by hgpu
Jason Loew, Jesse Elwell, Dmitry Ponomarev, Patrick H. Madden
September 23, 2011 by hgpu
Shuai Mu, Chenxi Wang, Ming Liu, Dongdong Li, Maohua Zhu, Xiaoliang Chen, Xiang Xie, Yangdong Deng
May 30, 2011 by hgpu
Muhsen Owaida, Nikolaos Bellas, Konstantis Daloukas, Christos D. Antonopoulos
Tags: Code generation, Compilers, Computer science, Electronic design automation, Embedded high-performance computing, FPGA, Heterogeneous systems, OpenCL
May 21, 2011 by hgpu
T. Scogland, H. Lin, W. Feng
Tags: Computer science, Embedded high-performance computing, Energy-efficient computing, Green, Performance
November 2, 2010 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- The Anatomy of a Triton Attention Kernel
- CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization
- Scalable GPU-Based Integrity Verification for Large Machine Learning Models
- INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
- An MLIR pipeline for offloading Fortran to FPGAs via OpenMP
- Enhancing Transformer Performance and Portability through Auto-tuning Frameworks
- KernelBand: Boosting LLM-based Kernel Optimization with a Hierarchical and Hardware-aware Multi-armed Bandit
- RDMA Point-to-Point Communication for LLM Systems
- A Study of Floating-Point Precision Tuning in Deep Learning Operators Implementations
- ProofWright: Towards Agentic Formal Verification of CUDA
* * *



