hgpu.org » Embedded high-performance computing
Kulin V. Seth
Tags: Benchmarking, Computer science, DSP, Embedded high-performance computing, Heterogeneous systems, OpenCL, Optimization, Thesis
September 23, 2011 by hgpu
Jason Loew, Jesse Elwell, Dmitry Ponomarev, Patrick H. Madden
September 23, 2011 by hgpu
Shuai Mu, Chenxi Wang, Ming Liu, Dongdong Li, Maohua Zhu, Xiaoliang Chen, Xiang Xie, Yangdong Deng
May 30, 2011 by hgpu
Muhsen Owaida, Nikolaos Bellas, Konstantis Daloukas, Christos D. Antonopoulos
Tags: Code generation, Compilers, Computer science, Electronic design automation, Embedded high-performance computing, FPGA, Heterogeneous systems, OpenCL
May 21, 2011 by hgpu
T. Scogland, H. Lin, W. Feng
Tags: Computer science, Embedded high-performance computing, Energy-efficient computing, Green, Performance
November 2, 2010 by hgpu
Recent source codes
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
* * *
Most viewed papers (last 30 days)
- DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
- Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
- Deep Kernel Fusion for Transformers
- Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards
- SciDef: Automating Definition Extraction from Academic Literature with Large Language Models
- StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning
- Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
- Inside VOLT: Designing an Open-Source GPU Compiler (Tool)
- Execution-Centric Characterization of FP8 Matrix Cores, Asynchronous Execution, and Structured Sparsity on AMD MI300A
- HetCCL: Accelerating LLM Training with Heterogeneous GPUs
* * *



