hgpu.org » Local memory
Jianbin Fang, Henk Sips, Ana Lucia Varbanescu
Tags: ATI, ATI Radeon HD 7970, Computer science, Intel Xeon Phi, Local memory, nVidia, OpenCL, Performance, Portability, Tesla C1060, Tesla C2050, Tesla K20
July 29, 2014 by jfang
Jianbin Fang, Henk Sips, Pekka Jaaskelainen, Ana Lucia Varbanescu
Tags: ATI, ATI Radeon HD 7970, Intel Xeon Phi, Local memory, nVidia, OpenCL, Reverse Engineering, Tesla C2050, Tesla K20
June 16, 2014 by jfang
Recent source codes
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
* * *
Most viewed papers (last 30 days)
- DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
- BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics
- Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
- Deep Kernel Fusion for Transformers
- SciDef: Automating Definition Extraction from Academic Literature with Large Language Models
- Towards Automated Kernel Generation in the Era of LLMs
- ProfInfer: An eBPF-based Fine-Grained LLM Inference Profiler
- Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards
- Private LLM Inference on Consumer Blackwell GPUs: A Practical Guide for Cost-Effective Local Deployment in SMEs
- PhysProver: Advancing Automatic Theorem Proving for Physics
* * *




