Papers on hgpu.org (.txt-file)
A Note on Auto-tuning GEMM for GPUs

A Note on Particle Filters Applied to DSGE Models

A note on the GPU acceleration of eigenvalue computations

A novel and scalable Multigrid algorithm for many-core architectures

A novel approach for implementing Steganography with computing power obtained by combining Cuda and Matlab

A novel approach to evaluating compact finite differences and similar tridiagonal schemes on GPU-accelerated clusters

A Novel Approach to Visualizing Dark Matter Simulations

A Novel Compiler Transformation for Fast Sparse Matrix Multiplication in GPUs

A Novel Computational Model for GPUs with Applications to Efficient Algorithms

A Novel Computing-Enhanced Cloud Storage Model Supporting Combined Service Aware
A Novel CPU/GPU Simulation Environment for Large-Scale Biologically-Realistic Neural Modeling

A Novel CSR-Based Sparse Matrix-Vector Multiplication on GPUs

A Novel Data Structure for Particle System Simulation based on GPU with the Use of Neighborhood Grids

A novel FPGA-based SVM classifier

A Novel GPU Implementation of Eigen Analysis for Risk Management

A Novel GPU-Based Deformation Pipeline

A Novel GPU-based Parallel Implementation Scheme and Performance Analysis of Robot Forward Dynamics Algorithms

A Novel Graphical Processing Unit Method for Power Systems Security Analysis

A novel hardware acceleration technique for high performance parallel FDTD method
A Novel Implementation of QuickHull Algorithm on the GPU

A Novel Interface for Interactive Exploration of DTI Fibers

A Novel Learning Algorithm for Bayesian Network and Its Efficient Implementation on GPU

A Novel Mapping of Arbitrary Precision Integer Operations to the GPU

A Novel Memory-Efficient Deep Learning Training Framework via Error-Bounded Lossy Compression

A Novel Monte Carlo Noise Reduction Operator

A Novel Multi-GPU Neural Simulator

A Novel Open Source Morphology Using GPU Processing With LTU-CUDA

A novel parallel Tier-1 coder for JPEG2000 using GPUs
A Novel Scheme for High Performance Finite-Difference Time-Domain (FDTD) Computations Based on GPU

A novel sorting algorithm for many-core architectures based on adaptive bitonic sort

A novel stereo camera based collision warning system for automotive applications
A NPR System for Generating Floral Patterns based on L-System

A Numerical Study of Continuous Data Assimilation for the 2D-NS Equations Using Nodal Points

A numerical tour of wave propagation

A Package for Multi-Dimensional Monte Carlo Integration on Multi-GPUs

A Package for OpenCL Based Heterogeneous Computing on Clusters with Many GPU Devices

A parallel accelerator for semantic search

A Parallel Access Method for Spatial Data Using GPU

A Parallel Active-Set Method for Solving Frictional Contact Problems

A Parallel Algorithm Development Model for the GPU Architecture

A Parallel Algorithm for Calculation of Large Determinants with High Accuracy for GPUs and MPI clusters

A Parallel Algorithm for Dot Product over Word-Size Finite Field Using Floating-Point Arithmetic

A Parallel Algorithm for Enumerating Joint Weight of a Binary Linear Code in Network Coding

A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware

A Parallel Algorithm for Flight Route Planning on GPU Using CUDA (thesis)

A parallel algorithm for implicit depletant simulations

A Parallel Algorithm for LZW Decompression, with GPU Implementation

A parallel algorithm for the constrained shortest path problem on lattice graphs

A Parallel Algorithm for UAV Flight Route Planning on GPU
A Parallel Algorithm of PCA-SIFT Based on CUDA

A Parallel Algorithm to Test Chordality of Graphs

A Parallel Ant Colony Optimization Algorithm for the Travelling Salesman Problem: Improving Performance Using CUDA

A Parallel Auxiliary Grid AMG Method for GPU

A Parallel Cellular Automaton Simulation Framework using CUDA

A Parallel Compression Pipeline for Improving GPU Virtualization Data Transfers

A parallel decoding algorithm of LDPC codes using CUDA

A Parallel Deconvolution Algorithm in Perfusion Imaging

A Parallel Depth-aided Exemplar-based Inpainting for Real-time View Synthesis on GPU

A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising

A parallel error diffusion implementation on a GPU

A parallel evolutionary algorithm to optimize dynamic memory managers in embedded systems

A Parallel Framework for Parametric Maximum Flow Problems in Image Segmentation

A parallel Genetic Programming algorithm for classification

A Parallel Gibbs Sampling Algorithm for Motif Finding on GPU
A Parallel GPU Version of the Traveling Salesman Problem

A Parallel Image Segmentation Algorithm on GPUs

A Parallel Immune Algorithm Based on Fine-Grained Model with GPU-Acceleration
A parallel implementation of a derivative pricing model incorporating SABR calibration and probability lookup tables

A Parallel Implementation of the Galerkin Method for Solving Partial Differential Equations on a Triangular Mesh

A Parallel Implementation of the Self Organising Map using OpenCL

A Parallel Intermediate Representation for Embedded Languages

A Parallel Jacobi-Type Lattice Basis Reduction Algorithm

A parallel mapping of optical flow to Compute Unified Device Architecture for motion-based image segmentation

A Parallel Mediated Reality Platform

A Parallel Method for Impulsive Image Noise Removal on Hybrid CPU/GPU Systems

A parallel method for tuning Fuzzy TSK Systems with CUDA

A Parallel Monte Carlo Code for Simulating Collisional N-body Systems

A Parallel Multi-view Rendering Architecture

A parallel pattern for iterative stencil + reduce

A Parallel Preconditioned Bi-Conjugate Gradient Stabilized Solver for the Poisson Problem

A Parallel Preconditioned Conjugate Gradient Solver for the Poisson Problem on a Multi-GPU Platform

A Parallel PSO Algorithm for a Watermarking Application on a GPU

A Parallel Ray Tracing Architecture Suitable for Application-Specific Hardware and GPGPU Implementations

A Parallel Recursive Approach for Solving All Pairs Shortest Path Problem on GPU using OpenCL

A parallel search tree algorithm for vertex cover on graphical processing units

A Parallel Solution to Finding Nodal Neighbors in Generic Meshes

A Parallel Solver for Markov Decision Process in Crowd Simulations

A Parallel Sparse Tensor Benchmark Suite on CPUs and GPUs

A Parallel Streaming Motion Estimation for Real-Time HD H.264 Encoding on Programmable Processors

A Parallel Supercomputer Implementation of a Biological Inspired Neural Network and its use for Pattern Recognition

A Parallel Tree Pattern Query Processing Algorithm for Graph Databases using a GPGPU

A Parallel Twig Join Algorithm for XML Processing using a GPGPU

A parallelization cost model for GPU
A Parallelized Algorithm for Hyperspectral Biometrics

A Parallelized Implementation for H. 264 Real-time Encoding Scheme

A Parallelizing Matlab Compiler Framework and Run time for Heterogeneous Systems

A parameterisable and scalable Smith-Waterman algorithm implementation on CUDA-compatible GPUs

Titles: 100
open PDFs: 88
packages: 13
