Papers on hgpu.org (.txt-file)
OpenCL Floating Point Software on Heterogeneous Architectures – Portable or Not?
OpenCL for Database Query Processing
OpenCL for FPGAs: Prototyping a Compiler
OpenCL for programming shared memory multicore CPUs
OpenCL FPGA Optimization guided by memory accesses and roofline model analysis applied to tomography acceleration
OpenCL framework for a CPU, GPU, and FPGA Platform
OpenCL Implementation of a Color Based Object Tracking
OpenCL Implementation of a Parallel Universal Kriging Algorithm for Massive Spatial Data Interpolation on Heterogeneous Systems
OpenCL Implementation of LiDAR Data Processing
OpenCL Implementation of Montgomery Multiplication on FPGA
OpenCL Implementation of Motion Estimation for Cloud Video Processing
OpenCL in Action: How to Accelerate Graphics and Computations
OpenCL JIT Compilation for Dynamic Programming Languages
OpenCL Library for Parallel Graph Search Algorithms
OpenCL Numerical Simulations of Two-Fluid Compressible Flows With a 2D Random Choice Method
OpenCL parallel Processing using General Purpose Graphical Processing units – TiViPE software development
OpenCL Parallel Programming Development Cookbook
OpenCL Performance Evaluation on Modern Multi Core CPUs
OpenCL Performance on the Intel Heterogeneous Architecture Research Platform
OpenCL Performance Prediction using Architecture-Independent Features
OpenCL Programming Guide for Mac
OpenCL programming using Python syntax
OpenCL simulations of two-fluid compressible flows with a random choice method
OpenCL Sparse Linear Solver for Circuit Simulation
OpenCL Task Partitioning in the Presence of GPU Contention
OpenCL Vector Swizzling Optimization under Global Value Numbering
OpenCL vs: Accelerated Finite-Difference Digital Synthesis
OpenCL vs. OpenMP: A Programmability Debate
OpenCL-Accelerated Computation of a 3D SPECT Projection Operator for the Content Adaptive Mesh Model
OpenCL-accelerated object classification in video streams using Spatial Pooler of Hierarchical Temporal Memory
OpenCL-accelerated Point Feature Histogram and Its Application in Railway Track Point Cloud Data Processing
OpenCL-Accelerated Simplified General Perturbations 4 Algorithm
OpenCL-based Algorithm for Heat Load Modelling of District Heating System
OpenCL-based design methodology for application-specific processors
OpenCL-Based Design of an FPGA Accelerator for Phase-Based Correspondence Matching
OpenCL-Based Erasure Coding on Heterogeneous Architectures
OpenCL-Based FPGA Accelerator for 3D FDTD with Periodic and Absorbing Boundary Conditions
OpenCL-Based Implementation of an FPGA Accelerator for Molecular Dynamics Simulation
OpenCL-Based Mobile GPGPU Benchmarking: Methods and Challenges
OpenCL-based optimizations for acceleration of object tracking on FPGAs and GPUs
OpenCL-Darknet: implementation and optimization of OpenCL-based deep learning object detection framework
OpenCL-ready High Speed FPGA Network for Reconfigurable High Performance Computing
OpenCL-Z Android Released on Google Play
OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems
OpenCL: a viable solution for high-performance medical image reconstruction?
OpenCL: Make Ubiquitous Supercomputing Possible
OpenCL/CUDA algorithms for parallel decoding of any irregular LDPC code using GPU
OpenCL/OpenGL aproach for studying active Brownian motion
OpenCLIPER: an OpenCL-based C++ Framework for Overhead-Reduced Medical Image Processing and Reconstruction on Heterogeneous Devices
OpenCUDA+MPI: A Framework for Heterogeneous GP-GPU Distributed Computing
OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library
OpenDwarfs 2025: Modernizing the OpenDwarfs Benchmark Suite for Heterogeneous Computing
OpenDwarfs: Characterization of Dwarf-Based Benchmarks on Fixed and Reconfigurable Architectures
OpenFace: A general-purpose face recognition library with mobile applications
OpenGL application live migration with GPU acceleration in personal cloud
OpenGL SuperBible: Comprehensive Tutorial and Reference (5th Edition)
Opengl-Based Control of Semi-Active 3D Display
OpenGL(R) ES 2.0 Programming Guide
OpenGL(R) Programming Guide: The Official Guide to Learning OpenGL(R), Version 2 (5th Edition)
OpenGL(R) Shading Language (2nd Edition)
OpenGL(R) SuperBible: Comprehensive Tutorial and Reference (4th Edition)
Opening the Black Box: Performance Estimation during Code Generation for GPUs
OpenMM 8: Molecular Dynamics Simulation with Machine Learning Potentials
OpenMM: A Hardware-Independent Framework for Molecular Simulations
OpenMP as a High-Level Specification Language for Parallelism And its use in Evaluating Parallel Programming Systems
OpenMP in Multicore Architectures (tech. report)
OpenMP Kernel Language Extensions for Performance Portable GPU Codes
OpenMP offload at the Exascale using Intel GPU Max 1550: evaluation of STREAmS compressible solver
OpenMP Offloading in the Jetson Nano Platform
OpenMP on Multicore Architectures
OpenMP Parallelization and Optimization of Graph-based Machine Learning Algorithms
OpenMP performance analysis for many-core platforms with non-uniform memory access
OpenMP Programming on Intel R Xeon Phi TM Coprocessors: An Early Performance Comparison
OpenMP to GPGPU: a compiler framework for automatic translation and optimization
OpenMP, OpenMP/MPI, and CUDA/MPI C programs for solving the time-dependent dipolar Gross-Pitaevskii equation
OpenMPC: Extended OpenMP for Efficient Programming and Tuning on GPUs
OpenMPC: Extended OpenMP Programming and Tuning for GPUs
OpenNMT: Open-Source Toolkit for Neural Machine Translation
OpenOF: Framework for Sparse Non-linear Least Squares Optimization on a GPU
OpenRAND: A Performance Portable, Reproducible Random Number Generation Library for Parallel Computations
OpenRCL: Low-Power High-Performance Computing with Reconfigurable Devices
OpenSBLI: A framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures
OpenSBLI: Automated code-generation for heterogeneous computing architectures applied to compressible fluid dynamics on structured grids
OpenSSL acceleration using Graphics Processing Units
OpenVIDIA: parallel GPU computer vision
Operating Systems Challenges for GPU Resource Management
Operating systems must support GPU abstractions
OPNET: An Integrated Design Paradigm for Simulations
Opportunities for Heterogeneous CPUGPU Task Scheduling
Opportunities for Nonvolatile Memory Systems in Extreme-Scale High Performance Computing
Opportunities for Parallelism in Matrix Multiplication
Opt: A Domain Specific Language for Non-linear Least Squares Optimization in Graphics and Imaging
Optical Flow Computation on Compute Unified Device Architecture
Optical Flow via Locally Adaptive Fusion of Complementary Data Costs
Optimal Alignment of Three Sequences On A GPU
Titles: 100
open PDFs: 88
packages: 19