Papers on hgpu.org (.txt-file)
Hardware-Assisted Software Testing and Debugging for Heterogeneous Computing
Hardware-assisted visibility sorting for unstructured volume rendering
Hardware-based nonlinear filtering and segmentation using high-level shading languages
Hardware-based simulation and collision detection for large particle systems
Hardware-Efficient Belief Propagation
Hardware-Oblivious Parallelism for In-Memory Column-Stores
Hardware-Oriented Multigrid Finite Element Solvers on GPU-Accelerated Clusters
Hardware/Software Co-Design for Data-Intensive Genomics Workloads
Hardware/Software Co-design for Energy-Efficient Seismic Modeling
Hardware/Software Vectorization for Closeness Centrality on Multi-/Many-Core Architectures
Harmonic CUDA: Asynchronous Programming on GPUs
Harnessing Aspect Oriented Programming on GPU: Application to Warp-Level Parallelism (WLP)
Harnessing GPU Computing in System-Level Software
Harnessing Integrated CPU-GPU System Memory for HPC: a first look into Grace Hopper
Harnessing the GPU for Real-Time Haptic Tissue Simulation
Harnessing the Power of GPUs without Losing Abstractions in SaC and ArrayOL: A Comparative Study
Harnessing the power of idle GPUs for acceleration of biological sequence alignment
Harvesting graphics power for MD simulations
Hash-Based Authentication Revisited in the Age of High-Performance Computers
HashGraph – Scalable Hash Tables Using A Sparse Graph Data Structure
Hashing, Caching, and Synchronization: Memory Techniques for Latency Masking Multithreaded Applications
Hauberk: Lightweight Silent Data Corruption Error Detector for GPGPU
Have GPUs made FPGAs redundant in the field of video processing?
HCudaBLAST: an implementation of BLAST on Hadoop and Cuda
HCW 2009 keynote talk: GPU computing: Heterogeneous computing for future systems
HDArray: Parallel Array Interface for Distributed Heterogeneous Devices
Head Pose Tracking Using GPU Based Real-time 3D Registration
Heat Load Modelling for District Heating Plants Using an OpenCL-based Algorithm
HEATS: Heterogeneity- and Energy-Aware Task-based Scheduling
HELIOS-K: An Ultrafast, Open-source Opacity Calculator for Radiative Transfer
Helix: Distributed Serving of Large Language Models via Max-Flow on Heterogeneous GPUs
Hera-JVM: a runtime system for heterogeneous multi-core architectures
Hercules: A Compiler for Productive Programming of Heterogeneous Systems
Hermes: an integrated CPU/GPU microarchitecture for IP routing
HeSP: a simulation framework for solving the task scheduling-partitioning problem on heterogeneous architectures
Hetero-DB: Next Generation High-Performance Database Systems by Best Utilizing Heterogeneous Computing and Storage Resources
Hetero-Mark, A Benchmark Suite for CPU-GPU Collaborative Computing
HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Reconfigurable Computing
Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads
Heterogeneity-aware Fault Tolerance using a Self-Organizing Runtime System
Heterogeneity-Aware Resource Allocation and Scheduling in the Cloud
Heterogeneous (CPU+GPU) Working-set Hash Tables
Heterogeneous Accelerated Bioinformatics-Perspectives for Cancer Research
Heterogeneous Acceleration of Volumetric JPEG 2000
Heterogeneous Active Messages (HAM) – Implementing Lightweight Remote Procedure Calls in C++
Heterogeneous Clustering with Homogeneous Code: Accelerate MPI Applications Without Code Surgery Using Intel Xeon Phi Coprocessors
Heterogeneous Computing and Grid Scheduling with Hierarchically Parallel Evolutionary Algorithms
Heterogeneous Computing and Load Balancing Techniques for Monte Carlo Simulation in a Distributed Environment
Heterogeneous Computing for Data Stream Mining
Heterogeneous Computing for Real-Time Stereo Matching
Heterogeneous Computing for Solving System of the Linear Equations by the Conjugate Gradient Method
Heterogeneous Computing for Vertebra Detection and Segmentation in X-Ray Images
Heterogeneous Computing in Economics: a Simplified Approach
Heterogeneous Computing on Mixed Unstructured Grids with PyFR
Heterogeneous computing with an algorithmic skeleton framework
Heterogeneous Computing with OpenCL
Heterogeneous CPU/(GP) GPU Memory Hierarchy Analysis and Optimization
Heterogeneous CPU/GPU co-execution of CFD simulations on the POWER9 architecture: Application to airplane aerodynamics
Heterogeneous Distributed Big Data Clustering on Sparse Grids
Heterogeneous Energy-aware Load Balancing for Industry 4.0 and IoT Environments
Heterogeneous FTDT for Seismic Processing
Heterogeneous GPU and CPU acceleration of a finite volume compressible flow solver for multiblock structured grids
Heterogeneous GPU&CPU cluster for High Performance Computing in cryptography
Heterogeneous High Throughput Scientific Computing with APM X-Gene and Intel Xeon Phi
Heterogeneous Highly Parallel Implementation of Matrix Exponentiation Using GPU
Heterogeneous multicore parallel programming for graphics processing units
Heterogeneous Network Embedding via Deep Architectures
Heterogeneous NPACI-Rocks/MPI/CUDA distributed multi-GPGPU application for seeking counterexamples to Beal’s Conjecture: MPI/CUDA integration component
Heterogeneous parallel algorithms for Computational Fluid Dynamics on unstructured meshes
Heterogeneous parallel computing for image registration and linear algebra applications
Heterogeneous Parallelization and Acceleration of Molecular Dynamics Simulations in GROMACS
Heterogeneous Programming with Single Operation Multiple Data
Heterogeneous Resource-Elastic Management for FPGAs: Concepts, Theory and Implementation
Heterogeneous Resource-Elastic Scheduling for CPU+FPGA Architectures
Heterogeneous Task Scheduling for Accelerated OpenMP
Heterogenous Acceleration for Linear Algebra in Multi-Coprocessor Environments
HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators
HeterPS: Distributed Deep Learning With Reinforcement Learning Based Scheduling in Heterogeneous Environments
HetExchange: Encapsulating heterogeneous CPU-GPU parallelism in JIT compiled engines
HetPipe: Enabling Large DNN Training on (Whimpy) Heterogeneous GPU Clusters through Integration of Pipelined Model Parallelism and Data Parallelism
Heuristic Adaptability to Input Dynamics for SpMM on GPUs
Heuristics for Conversion Process of GPU’s Kernels for Multiples Kernels with Concurrent Optimization Divergence
Heuristics for the Variable Sized Bin Packing Problem Using a Hybrid P-System and CUDA Architecture
HexServer: an FFT-based protein docking server powered by graphics processors
HG-Caffe: Mobile and Embedded Neural Network GPU (OpenCL) Inference Engine with FP16 Supporting
HHT-based time-frequency analysis method for biomedical signal applications
HiAL-Ckpt: A hierarchical application-level checkpointing for CPU-GPU hybrid systems
HiCCL: A Hierarchical Collective Communication Library
hiCUDA: a high-level directive-based language for GPU programming
hiCUDA: High-Level GPGPU Programming
Hidden Surface Removal Using BSP Tree with CUDA
HiDP: A Hierarchical Data Parallel Language
Hierarchical belief propagation to reduce search space using CUDA for stereo and motion estimation
Hierarchical clustering of gene expression profiles with graphics hardware acceleration
Hierarchical DAG Scheduling for Hybrid Distributed Systems
Hierarchical Exploration of Volumes Using Multilevel Segmentation of the Intensity-Gradient Histograms
Hierarchical fractional-step approximations and parallel kinetic Monte Carlo algorithms
Titles: 100
open PDFs: 93
packages: 19