Papers on hgpu.org (.txt-file)
Pangolin: An Efficient and Flexible Graph Mining System on CPU and GPU

PanJoin: A Partition-based Adaptive Stream Join

PANNA: Properties from Artificial Neural Network Architectures

Pannotia: Understanding Irregular GPGPU Graph Applications

PantaRay: fast ray-traced occlusion caching of massive scenes
PAPER – Accelerating parallel evaluations of ROCS

ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation

ParadisEO-MO-GPU: a Framework for Parallel GPU-based Local Search Metaheuristics

Paragon: Collaborative Speculative Loop Execution on GPU and CPU

ParaGraph: Weighted Graph Representation for Performance Optimization of HPC Kernels

Paraiso : An Automated Tuning Framework for Explicit Solvers of Partial Differential Equations

Parakeet: A Just-In-Time Parallel Accelerator for Python

Parallax: Automatic Data-Parallel Training of Deep Neural Networks

Paralleizing AwSpPCA for robust facial recognition using CUDA

Parallel 3D Fast Wavelet Transform comparison on CPUs and GPUs

Parallel 3D Finite Difference Time Domain Simulations on Graphics Processors with Cuda
Parallel 3D Image Segmentation of Large Data Sets on a GPU Cluster

Parallel 3D multigrid methods on the STI cell BE architecture

Parallel 5 point SOR for solving the Convection Diffusion equation using graphics processing units

Parallel acceleration of CPU and GPU range queries over large data sets

Parallel Acceleration on Manycore Systems and Its Performance Analysis: OpenCL Case Study

Parallel accelerators for GlimmerHMM bioinformatics algorithm
Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations

Parallel AES algorithm for fast Data Encryption on GPU

Parallel AES Encryption Engines for Many-Core Processor Arrays

Parallel Agent systems on a GPU for use with Simulations and Games

Parallel Algorithm Design and Implementation of Regular/Irregular Problems: An In-depth Performance Study on Graphics Processing Units

Parallel Algorithm for BSDEs Based High Dimensional American Option Pricing on the GPU

Parallel Algorithm for Generation of Test Recommended Path using CUDA

Parallel Algorithm for GPU Processing; for use in High Speed Machine Vision Sensing of Cotton Lint Trash

Parallel Algorithm for Solving Kepler’s Equation on Graphics Processing Units: Application to Analysis of Doppler Exoplanet Searches

Parallel Algorithm of IDCT with GPUs and CUDA for Large-scale Video Quality of 3G

Parallel algorithms for approximation of distance maps on parametric surfaces

Parallel Algorithms for Constructing Data Structures for Fast Multipole Methods

Parallel Algorithms for Counting Problems on Graphs Using Graphics Processing Units

Parallel Algorithms for GPU accelerated Probabilistic Inference

Parallel Algorithms for Hybrid Multi-core CPU-GPU Implementations of Component Labelling in Critical Phase Models

Parallel algorithms for problems of cluster analysis with very large amount of data

Parallel Algorithms for the Summed Area Table on the Asynchronous Hierarchical Memory Machine, with GPU implementations

Parallel algorithms to a parallel hardware: Designing vision algorithms for a GPU

Parallel and Concurrent Programming in Haskell: Techniques for Multicore and Multithreaded Programming

Parallel and Distributed Deep Learning

Parallel and Distributed Implementations of Multiple and Two-Dimensional Pattern Matching Algorithms

Parallel and efficient Boolean on polygonal solids

Parallel and Heterogeneous Timing Analysis: Partition, Algorithm, and System

Parallel and Improved PageRank Algorithm for GPU-CPU Collaborative Environment

Parallel and in-process compilation of individuals for genetic programming on GPU

Parallel and Scalable Sparse Basic Linear Algebra Subprograms

Parallel ant colony for nonlinear function optimization with graphics hardware acceleration
Parallel Application Library for Object Recognition

Parallel Approach for Longest Common Subsequence problem on GPU

Parallel Approach for Time Series Analysis with General Regression Neural Networks

Parallel Approaches for SWAMP Sequence Alignment
Parallel Approaches to Edit Distance and Approximate String Matching

Parallel Approaches to Shortest-Path Problems for Multilevel Heterogeneous Computing

Parallel Arbitrary-precision Integer Arithmetic

Parallel Asynchronous Modelization and Execution of Cholesky Algorithm using Petri Nets

Parallel Banding Algorithm to compute exact distance transform with the GPU

Parallel Batch Training of the Self-Organizing Map Using OpenCL
Parallel Benefit on Different Programming Paradigms

Parallel Bio-Inspired Methods for Model Optimization and Pattern Recognition

Parallel birth and death process for cell nuclei extraction in histopathology images

Parallel Branch and Bound on a CPU-GPU System

Parallel Branch Prediction on GPU Platform

Parallel Breadth First Search on GPU Clusters

Parallel BTF Compression with Multi-Level Vector Quantization in OpenCL

Parallel calculation of the median and order statistics on GPUs with application to robust regression

Parallel Catmull-Rom Spline Interpolation Algorithm for Image Zooming Based on CUDA

Parallel centerline extraction on the GPU

Parallel Chen-Han (PCH) Algorithm for Discrete Geodesics

Parallel Circuit Simulation on Graphical Processing Unit

Parallel Cloth Simulation Using OpenMP and CUDA

Parallel Compact Genetic Algorithm on CUDA-C Platform

Parallel compact roadmap construction of 3D virtual environments on the GPU

Parallel Compression Checkpointing for Socket-Level Heterogeneous Systems

Parallel Computation for Discrete Orthogonal Moments of Images Using Graphic Processing Unit

Parallel Computation of 2D Morse-Smale Complexes

Parallel computation of a SPECT projection operator for a content adaptative mesh model

Parallel Computation of Functions on Set Partitions

Parallel computation of mutual information on the GPU with application to real-time registration of 3D medical images

Parallel Computation of Non-Bonded Interactions in Drug Discovery: Nvidia GPUs vs. Intel Xeon Phi

Parallel computation of spherical parameterizations for mesh analysis

Parallel Computational Fluid Dynamics With the Intel Xeon Phi Coprocessor

Parallel Computational Intelligence-Based Multi-Camera Surveillance System

Parallel Computations for Hierarchical Agglomerative Clustering using CUDA

Parallel computations on GPU in 3D using the vortex particle method

Parallel Computer Vision: Person Data Extraction

Parallel Computing based on GPGPU using Compute Unified Device Architecture

Parallel Computing Experiences with CUDA

Parallel Computing for Accelerated Texture Classification with Local Binary Pattern Descriptors using OpenCL

Parallel Computing for the Inverse of SPD matrix

Parallel computing in a quantitative trading firm
Parallel Computing Methods For Particle Accelerator Design

Parallel Computing Model of Multiple Dimensions Data Streams Canonical Correlation Analysis with GPU
Parallel computing of 3D smoking simulation based on OpenCL heterogeneous platform
Parallel Computing of Discrete Element Method on GPU

Parallel Computing of Particle Trajectory Sonification to Enable Real-Time Interactivity

Parallel computing system for the efficient calculation of molecular similarity based on negative electrostatic potential

Parallel Computing the Longest Common Subsequence (LCS) on GPUs: Efficiency and Language Suitability

Titles: 100
open PDFs: 89
packages: 17
