Papers on hgpu.org (.txt-file)
Hash-Based Authentication Revisited in the Age of High-Performance Computers

HashGraph – Scalable Hash Tables Using A Sparse Graph Data Structure

Hashing, Caching, and Synchronization: Memory Techniques for Latency Masking Multithreaded Applications

Hauberk: Lightweight Silent Data Corruption Error Detector for GPGPU

Have GPUs made FPGAs redundant in the field of video processing?
HCudaBLAST: an implementation of BLAST on Hadoop and Cuda

HCW 2009 keynote talk: GPU computing: Heterogeneous computing for future systems

HDArray: Parallel Array Interface for Distributed Heterogeneous Devices

Head Pose Tracking Using GPU Based Real-time 3D Registration

Heat Load Modelling for District Heating Plants Using an OpenCL-based Algorithm

HEATS: Heterogeneity- and Energy-Aware Task-based Scheduling

HELIOS-K: An Ultrafast, Open-source Opacity Calculator for Radiative Transfer

Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs

Hera-JVM: a runtime system for heterogeneous multi-core architectures

Hercules: A Compiler for Productive Programming of Heterogeneous Systems

Hermes: an integrated CPU/GPU microarchitecture for IP routing

HeSP: a simulation framework for solving the task scheduling-partitioning problem on heterogeneous architectures

Hetero-DB: Next Generation High-Performance Database Systems by Best Utilizing Heterogeneous Computing and Storage Resources

Hetero-Mark, A Benchmark Suite for CPU-GPU Collaborative Computing

HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Reconfigurable Computing

Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads

Heterogeneity-aware Fault Tolerance using a Self-Organizing Runtime System

Heterogeneity-Aware Resource Allocation and Scheduling in the Cloud

Heterogeneous (CPU+GPU) Working-set Hash Tables

Heterogeneous Accelerated Bioinformatics-Perspectives for Cancer Research

Heterogeneous Acceleration of Volumetric JPEG 2000

Heterogeneous Active Messages (HAM) – Implementing Lightweight Remote Procedure Calls in C++

Heterogeneous Clustering with Homogeneous Code: Accelerate MPI Applications Without Code Surgery Using Intel Xeon Phi Coprocessors

Heterogeneous Computing and Grid Scheduling with Hierarchically Parallel Evolutionary Algorithms

Heterogeneous Computing and Load Balancing Techniques for Monte Carlo Simulation in a Distributed Environment

Heterogeneous Computing for Data Stream Mining

Heterogeneous Computing for Real-Time Stereo Matching

Heterogeneous Computing for Solving System of the Linear Equations by the Conjugate Gradient Method

Heterogeneous Computing for Vertebra Detection and Segmentation in X-Ray Images

Heterogeneous Computing in Economics: a Simplified Approach

Heterogeneous Computing on Mixed Unstructured Grids with PyFR

Heterogeneous computing with an algorithmic skeleton framework

Heterogeneous Computing with OpenCL

Heterogeneous CPU/(GP) GPU Memory Hierarchy Analysis and Optimization

Heterogeneous CPU/GPU co-execution of CFD simulations on the POWER9 architecture: Application to airplane aerodynamics

Heterogeneous Distributed Big Data Clustering on Sparse Grids

Heterogeneous Energy-aware Load Balancing for Industry 4.0 and IoT Environments

Heterogeneous FTDT for Seismic Processing

Heterogeneous GPU and CPU acceleration of a finite volume compressible flow solver for multiblock structured grids

Heterogeneous GPU&CPU cluster for High Performance Computing in cryptography

Heterogeneous High Throughput Scientific Computing with APM X-Gene and Intel Xeon Phi

Heterogeneous Highly Parallel Implementation of Matrix Exponentiation Using GPU

Heterogeneous multicore parallel programming for graphics processing units
Heterogeneous Network Embedding via Deep Architectures

Heterogeneous NPACI-Rocks/MPI/CUDA distributed multi-GPGPU application for seeking counterexamples to Beal’s Conjecture: MPI/CUDA integration component

Heterogeneous parallel algorithms for Computational Fluid Dynamics on unstructured meshes

Heterogeneous parallel computing for image registration and linear algebra applications

Heterogeneous Parallelization and Acceleration of Molecular Dynamics Simulations in GROMACS

Heterogeneous Programming with Single Operation Multiple Data

Heterogeneous Resource-Elastic Management for FPGAs: Concepts, Theory and Implementation

Heterogeneous Resource-Elastic Scheduling for CPU+FPGA Architectures

Heterogeneous Task Scheduling for Accelerated OpenMP

Heterogenous Acceleration for Linear Algebra in Multi-Coprocessor Environments

HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators

HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments

HetExchange: Encapsulating heterogeneous CPU-GPU parallelism in JIT compiled engines

HetPipe: Enabling Large DNN Training on (Whimpy) Heterogeneous GPU Clusters through Integration of Pipelined Model Parallelism and Data Parallelism

Heuristic Adaptability to Input Dynamics for SpMM on GPUs

Heuristics for Conversion Process of GPU’s Kernels for Multiples Kernels with Concurrent Optimization Divergence

Heuristics for the Variable Sized Bin Packing Problem Using a Hybrid P-System and CUDA Architecture

HexServer: an FFT-based protein docking server powered by graphics processors

HG-Caffe: Mobile and Embedded Neural Network GPU (OpenCL) Inference Engine with FP16 Supporting

HHT-based time-frequency analysis method for biomedical signal applications

HiAL-Ckpt: A hierarchical application-level checkpointing for CPU-GPU hybrid systems
HiCCL: A Hierarchical Collective Communication Library

hiCUDA: a high-level directive-based language for GPU programming

hiCUDA: High-Level GPGPU Programming

Hidden Surface Removal Using BSP Tree with CUDA

HiDP: A Hierarchical Data Parallel Language

Hierarchical belief propagation to reduce search space using CUDA for stereo and motion estimation

Hierarchical clustering of gene expression profiles with graphics hardware acceleration
Hierarchical DAG Scheduling for Hybrid Distributed Systems

Hierarchical Exploration of Volumes Using Multilevel Segmentation of the Intensity-Gradient Histograms

Hierarchical fractional-step approximations and parallel kinetic Monte Carlo algorithms

Hierarchical Mapping Techniques for Signal Processing Systems on Parallel Platforms

Hierarchical Markov Random Fields Applied to Model Soft Tissue Deformations on Graphics Hardware

Hierarchical Matrix Operations on GPUs: Matrix-Vector Multiplication and Compression

Hierarchical N-body simulations with auto-tuning for heterogeneous systems

Hierarchical overlapped tiling

Hierarchical Partitioning Algorithm for Scientific Computing on Highly Heterogeneous CPU + GPU Clusters

Hierarchical QR factorization algorithms for multi-core cluster systems

Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach

Hierarchical Roofline Analysis: How to Collect Data using Performance Tools on Intel CPUs and NVIDIA GPUs

Hierarchical Semantic Parsing for Object Pose Estimation in Densely Cluttered Scenes

Hierarchical Stochastic Motion Blur Rasterization

Hierarchical Transparent Programming for Heterogeneous Computing

Hierarchical Visualization and Compression of Large Volume Datasets Using GPU Clusters

High accuracy electron beam model development: MICHELLE eBEAM

High Accuracy Gravitational Waveforms from Black Hole Binary Inspirals Using OpenCL

High accuracy solutions to energy gradient flows from material science models

Titles: 100
open PDFs: 91
packages: 18
