Papers on hgpu.org (.txt-file)
Computing High Resolution Explicit Corridor Maps using Parallel Technologies

Computing least squares condition numbers on hybrid multicore/GPU systems

Computing Nash Equilibria in Bimatrix Games: GPU-based Parallel Support Enumeration

Computing of high breakdown regression estimators without sorting on graphics processing units

Computing on Knights and Kepler Architectures

Computing OpenSURF on OpenCL and General Purpose GPU

Computing optical flow using fast total variation

Computing Optimal Cycle Mean in Parallel on CUDA

Computing Performance Benchmarks among CPU, GPU, and FPGA

Computing Prestack Kirchhoff Time Migration on General Purpose GPU
Computing Privacy-Preserving Edit Distance and Smith-Waterman Problems on the GPU Architecture

Computing Reachable Sets via Barrier Methods on SIMD Architectures

Computing resultants on Graphics Processing Units: Towards GPU-accelerated computer algebra

Computing room acoustics with CUDA – 3D FDTD schemes with boundary losses and viscosity

Computing room acoustics with CUDA-3D FDTD schemes with boundary losses and viscosity

Computing Spatial Distance Histograms for Large Scientific Datasets On-the-Fly

Computing Spectral Transforms Used in Digital Logic on the GPU

Computing spike-based convolutions on GPUs

Computing Strongly Connected Components in Parallel on CUDA

Computing Strongly Connected Components with CUDA

Computing the distance between two finite element solutions defined on different 3D meshes on a GPU

Computing the Mertens function on a GPU

Computing Treewidth on the GPU

Computing trends using graphic processor in high energy physics

Computing virtual acoustics using the 3D finite difference time domain method and Kepler architecture GPUs

Computitional intensive Tasks in Multimedia Signal Processing

Compyle: a Python package for parallel computing

ConCuR: Conciseness Makes State-of-the-Art Kernel Generation

Concurrency Mapping to FPGAs with OpenCL: A Case Study with a Shallow Water Kernel

Concurrent Algorithms and Data Structures for Many-Core Processors

Concurrent Analytical Query Processing with GPUs

Concurrent CPU-GPU Task Programming using Modern C++

Concurrent kernel execution on Graphic Processing Units

Concurrent learning of a Probabilistic Graphical Model on the GPU

Concurrent Manipulation of Dynamic Data Structures in OpenCL

Concurrent Number Cruncher: An Efficient Sparse Linear Solver on the GPU

Concurrent query processing in a GPU-based database system

Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems

Concurrent Solutions to Linear Systems using Hybrid CPU/GPU Nodes

Concurrent Task Execution on the Intel Xeon Phi

Conditional component composition for GPU-based systems

Cone-beam Computed tomography image reconstruction based on GPU

Confidential Computing on Heterogeneous Systems: Survey and Implications

Confidentiality Issues on a GPU in a Virtualized Environment

Configuration and Benchmarks of Peer-to-Peer Communication over Gigabit Ethernet and InfiniBand in a Cluster with Intel Xeon Phi Coprocessors

Conflux: Embedding Massively Parallel Semantics in a High-Level Programming Language

Conjugate gradient solvers on Intel Xeon Phi and NVIDIA GPUs

Connected component identification and cluster update on GPU

Connected component labeling on a 2D grid using CUDA

Connected-component identification and cluster update on graphics processing units

Connecting Architecture, Fitness, Optimizations and Performance using an Anisotropic Diffusion Filter

Connectivity-Based Segmentation for GPU-Accelerated Mesh Decompression

Considerations when evaluating microprocessor platforms

Considering GPGPU for HPC Centers: Is It Worth the Effort?

Consolidating Applications for Energy Efficiency in Heterogeneous Computing Systems

Constrained inverse volume rendering for planetary nebulae

Constructing Long Short-Term Memory based Deep Recurrent Neural Networks for Large Vocabulary Speech Recognition

Constructing Natural Neighbor Interpolation Based Grid DEM Using CUDA

Constructing Two-Dimensional Voronoi Diagrams via Divide-and-Conquer of Envelopes in Space

Constructing Two-Dimensional Voronoi Diagrams via Divide-and-Conquer of Envelopes in Space (thesis)

Construction and Implementation of a Simple Agent-Based System on GPU-Architectures

Construction and Rendering of Trimmed Blending Surfaces with Sharp Features on a GPU

Construction of a Virtual Cluster by Integrating PCI Pass-Through for GPU and InfiniBand Virtualization in Cloud

Construction of Efficient Kd-Trees for Static Scenes Using Voxel-visibility Heuristic

Content Based Image Retrieval with Graphical Processing Unit

Context Parallelism for Scalable Million-Token Inference

Context-aware volume navigation
Continual surface-based multi-projector blending for moving objects

Continuous Level of Detail on Graphics Hardware

Continuous Representation of Projected Attribute Spaces of Multifields over Any Spatial Sampling

Contour-based algorithm for vectorization of satellite images

Contouring for Power Systems Using Graphical Processing Units

Contract-Based General-Purpose GPU Programming

ConTraPh: Contrastive Learning for Parallelization and Performance Optimization

Contributions of hybrid architectures to depth imaging: a CPU, APU and GPU comparative study

Contributions to Music Semantic Analysis and Its Acceleration Techniques

Contributions to Parallel Simulation of Equation-Based Models on Graphics Processing Units

Contributions to parallel stochastic simulation: Application of good software engineering practices to the distribution of pseudorandom streams in hybrid Monte-Carlo simulations

Contributions to the Efficient Use of General Purpose Coprocessors: Kernel Density Estimation as Case Study

Convergence and Scalarization for Data-Parallel Architectures

Converting Data to Task-Parallelism by Rewrites

Converting Data-Parallelism to Task-Parallelism by Rewrites: Purely Functional Programs Across Multiple GPUs

Convex Clustering: An Attractive Alternative to Hierarchical Clustering

Convolution of large 3D images on GPU and its decomposition

Convolutional Neural Network for Sentence Classification

Convolutional Neural Network-Based Image Representation for Visual Loop Closure Detection

Convolutional Neural Networks for Human Activity Recognition using Mobile Sensors

Convolutional Neural Networks for Large-Scale Bird Song Classification in Noisy Environment

COOK Access Control on an embedded Volta GPU

Cooperative CPU, GPU, and FPGA heterogeneous execution with EngineCL

Cooperative Heterogeneous Computing for Parallel Processing on CPU/GPU Hybrids

Cooperative Kernels: GPU Multitasking for Blocking Algorithms

Cooperative Multitasking for GPU-Accelerated Grid Systems

Coordinate strip-mining and kernel fusion to lower power consumption on GPU
Coordinated system level resource management for heterogeneous many-core platforms

Copperhead: Compiling an embedded data parallel language

Titles: 100
open PDFs: 97
packages: 15
