Papers on hgpu.org (.txt-file)
Hardware-assisted Rendering of CSG Models

Hardware-Assisted Software Testing and Debugging for Heterogeneous Computing

Hardware-assisted visibility sorting for unstructured volume rendering

Hardware-based nonlinear filtering and segmentation using high-level shading languages

Hardware-based simulation and collision detection for large particle systems

Hardware-Efficient Belief Propagation

Hardware-Oblivious Parallelism for In-Memory Column-Stores

Hardware-Oriented Multigrid Finite Element Solvers on GPU-Accelerated Clusters

Hardware/Software Co-Design for Data-Intensive Genomics Workloads

Hardware/Software Co-design for Energy-Efficient Seismic Modeling

Hardware/Software Vectorization for Closeness Centrality on Multi-/Many-Core Architectures

Harmonic CUDA: Asynchronous Programming on GPUs

Harnessing Aspect Oriented Programming on GPU: Application to Warp-Level Parallelism (WLP)

Harnessing Batched BLAS/LAPACK Kernels on GPUs for Parallel Solutions of Block Tridiagonal Systems

Harnessing GPU Computing in System-Level Software

Harnessing Integrated CPU-GPU System Memory for HPC: a first look into Grace Hopper

Harnessing the GPU for Real-Time Haptic Tissue Simulation

Harnessing the Power of GPUs without Losing Abstractions in SaC and ArrayOL: A Comparative Study

Harnessing the power of idle GPUs for acceleration of biological sequence alignment

Harvesting graphics power for MD simulations

Hash-Based Authentication Revisited in the Age of High-Performance Computers

HashGraph – Scalable Hash Tables Using A Sparse Graph Data Structure

Hashing, Caching, and Synchronization: Memory Techniques for Latency Masking Multithreaded Applications

Hauberk: Lightweight Silent Data Corruption Error Detector for GPGPU

Have GPUs made FPGAs redundant in the field of video processing?
HCudaBLAST: an implementation of BLAST on Hadoop and Cuda

HCW 2009 keynote talk: GPU computing: Heterogeneous computing for future systems

HDArray: Parallel Array Interface for Distributed Heterogeneous Devices

Head Pose Tracking Using GPU Based Real-time 3D Registration

Heat Load Modelling for District Heating Plants Using an OpenCL-based Algorithm

HEATS: Heterogeneity- and Energy-Aware Task-based Scheduling

HELIOS-K: An Ultrafast, Open-source Opacity Calculator for Radiative Transfer

Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs

Hera-JVM: a runtime system for heterogeneous multi-core architectures

Hercules: A Compiler for Productive Programming of Heterogeneous Systems

Hermes: an integrated CPU/GPU microarchitecture for IP routing

HeSP: a simulation framework for solving the task scheduling-partitioning problem on heterogeneous architectures

HetCCL: Accelerating LLM Training with Heterogeneous GPUs

Hetero-DB: Next Generation High-Performance Database Systems by Best Utilizing Heterogeneous Computing and Storage Resources

Hetero-Mark, A Benchmark Suite for CPU-GPU Collaborative Computing

HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Reconfigurable Computing

Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads

Heterogeneity-aware Fault Tolerance using a Self-Organizing Runtime System

Heterogeneity-Aware Resource Allocation and Scheduling in the Cloud

Heterogeneous (CPU+GPU) Working-set Hash Tables

Heterogeneous Accelerated Bioinformatics-Perspectives for Cancer Research

Heterogeneous Acceleration of Volumetric JPEG 2000

Heterogeneous Active Messages (HAM) – Implementing Lightweight Remote Procedure Calls in C++

Heterogeneous Clustering with Homogeneous Code: Accelerate MPI Applications Without Code Surgery Using Intel Xeon Phi Coprocessors

Heterogeneous Computing and Grid Scheduling with Hierarchically Parallel Evolutionary Algorithms

Heterogeneous Computing and Load Balancing Techniques for Monte Carlo Simulation in a Distributed Environment

Heterogeneous Computing for Data Stream Mining

Heterogeneous Computing for Real-Time Stereo Matching

Heterogeneous Computing for Solving System of the Linear Equations by the Conjugate Gradient Method

Heterogeneous Computing for Vertebra Detection and Segmentation in X-Ray Images

Heterogeneous Computing in Economics: a Simplified Approach

Heterogeneous Computing on Mixed Unstructured Grids with PyFR

Heterogeneous computing with an algorithmic skeleton framework

Heterogeneous Computing with OpenCL

Heterogeneous CPU/(GP) GPU Memory Hierarchy Analysis and Optimization

Heterogeneous CPU/GPU co-execution of CFD simulations on the POWER9 architecture: Application to airplane aerodynamics

Heterogeneous Distributed Big Data Clustering on Sparse Grids

Heterogeneous Energy-aware Load Balancing for Industry 4.0 and IoT Environments

Heterogeneous FTDT for Seismic Processing

Heterogeneous GPU and CPU acceleration of a finite volume compressible flow solver for multiblock structured grids

Heterogeneous GPU&CPU cluster for High Performance Computing in cryptography

Heterogeneous High Throughput Scientific Computing with APM X-Gene and Intel Xeon Phi

Heterogeneous Highly Parallel Implementation of Matrix Exponentiation Using GPU

Heterogeneous multicore parallel programming for graphics processing units
Heterogeneous Network Embedding via Deep Architectures

Heterogeneous NPACI-Rocks/MPI/CUDA distributed multi-GPGPU application for seeking counterexamples to Beal’s Conjecture: MPI/CUDA integration component

Heterogeneous parallel algorithms for Computational Fluid Dynamics on unstructured meshes

Heterogeneous parallel computing for image registration and linear algebra applications

Heterogeneous Parallelization and Acceleration of Molecular Dynamics Simulations in GROMACS

Heterogeneous Programming with Single Operation Multiple Data

Heterogeneous Resource-Elastic Management for FPGAs: Concepts, Theory and Implementation

Heterogeneous Resource-Elastic Scheduling for CPU+FPGA Architectures

Heterogeneous Task Scheduling for Accelerated OpenMP

Heterogenous Acceleration for Linear Algebra in Multi-Coprocessor Environments

HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators

HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments

HetExchange: Encapsulating heterogeneous CPU-GPU parallelism in JIT compiled engines

HetPipe: Enabling Large DNN Training on (Whimpy) Heterogeneous GPU Clusters through Integration of Pipelined Model Parallelism and Data Parallelism

Heuristic Adaptability to Input Dynamics for SpMM on GPUs

Heuristics for Conversion Process of GPU’s Kernels for Multiples Kernels with Concurrent Optimization Divergence

Heuristics for the Variable Sized Bin Packing Problem Using a Hybrid P-System and CUDA Architecture

HexServer: an FFT-based protein docking server powered by graphics processors

HG-Caffe: Mobile and Embedded Neural Network GPU (OpenCL) Inference Engine with FP16 Supporting

HHT-based time-frequency analysis method for biomedical signal applications

HiAL-Ckpt: A hierarchical application-level checkpointing for CPU-GPU hybrid systems
HiCCL: A Hierarchical Collective Communication Library

hiCUDA: a high-level directive-based language for GPU programming

hiCUDA: High-Level GPGPU Programming

Hidden Surface Removal Using BSP Tree with CUDA

HiDP: A Hierarchical Data Parallel Language

Hierarchical belief propagation to reduce search space using CUDA for stereo and motion estimation

Hierarchical clustering of gene expression profiles with graphics hardware acceleration
Titles: 100
open PDFs: 93
packages: 20
