hgpu.org » Embedded high-performance computing
Kulin V. Seth
Tags: Benchmarking, Computer science, DSP, Embedded high-performance computing, Heterogeneous systems, OpenCL, Optimization, Thesis
September 23, 2011 by hgpu
Jason Loew, Jesse Elwell, Dmitry Ponomarev, Patrick H. Madden
September 23, 2011 by hgpu
Shuai Mu, Chenxi Wang, Ming Liu, Dongdong Li, Maohua Zhu, Xiaoliang Chen, Xiang Xie, Yangdong Deng
May 30, 2011 by hgpu
Muhsen Owaida, Nikolaos Bellas, Konstantis Daloukas, Christos D. Antonopoulos
Tags: Code generation, Compilers, Computer science, Electronic design automation, Embedded high-performance computing, FPGA, Heterogeneous systems, OpenCL
May 21, 2011 by hgpu
T. Scogland, H. Lin, W. Feng
Tags: Computer science, Embedded high-performance computing, Energy-efficient computing, Green, Performance
November 2, 2010 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels
- KEET: Explaining Performance of GPU Kernels Using LLM Agents
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- Kerncap: Automated Kernel Extraction and Isolation for AMD GPUs
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures
- CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging
- Source-to-Source Transformations for GPU Code Generation
* * *



