high performance computing on graphics processing units: hgpu.org

hgpu.org » Embedded high-performance computing

Integrated Framework for Heterogeneous Embedded Platforms Using OpenCL

Kulin V. Seth

View

Tags: Benchmarking, Computer science, DSP, Embedded high-performance computing, Heterogeneous systems, OpenCL, Optimization, Thesis

September 23, 2011 by hgpu

Mathematical limits of parallel computation for embedded systems

Jason Loew, Jesse Elwell, Dmitry Ponomarev, Patrick H. Madden

View

Download (PDF)

Tags: Computer science, Embedded high-performance computing, Performance

September 23, 2011 by hgpu

Evaluating the potential of graphics processors for high performance embedded computing

Shuai Mu, Chenxi Wang, Ming Liu, Dongdong Li, Maohua Zhu, Xiaoliang Chen, Xiang Xie, Yangdong Deng

Tags: ASIC, Benchmarking, Computer science, DSP, Embedded high-performance computing

May 30, 2011 by hgpu

Synthesis of Platform Architectures from OpenCL Programs

Muhsen Owaida, Nikolaos Bellas, Konstantis Daloukas, Christos D. Antonopoulos

View

Download (PDF)

Tags: Code generation, Compilers, Computer science, Electronic design automation, Embedded high-performance computing, FPGA, Heterogeneous systems, OpenCL

May 21, 2011 by hgpu

A first look at integrated GPUs for green high-performance computing

T. Scogland, H. Lin, W. Feng

Tags: Computer science, Embedded high-performance computing, Energy-efficient computing, Green, Performance

November 2, 2010 by hgpu

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

No More Shading Languages: Compiling C++ to Vulkan Shaders

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

* * *

high performance computing on graphics processing units: hgpu.org

Integrated Framework for Heterogeneous Embedded Platforms Using OpenCL

Mathematical limits of parallel computation for embedded systems

Evaluating the potential of graphics processors for high performance embedded computing

Synthesis of Platform Architectures from OpenCL Programs

A first look at integrated GPUs for green high-performance computing

Recent source codes

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Most viewed papers (last 30 days)