Papers on hgpu.org (.txt-file)
Image spatial diffusion on GPUs

Image super-resolution by vectorizing edges

Image Super-Resolution Using Deep Convolutional Networks

Image-based fast three-dimensional leaf modeling
Image-Based Material Restyling with Fast Non-local Means Filtering
Image-Based Proxy Accumulation for Real-Time Soft Global Illumination

Image-Space Caustics and Curvatures

Image-Space Collision Detection Through Alternate Surface Peeling

Image-Space GPU Metaballs for Time-Dependent Particle Data Sets

ImageCL: An Image Processing Language for Performance Portability on Heterogeneous Systems

ImageCL: Language and source-to-source compiler for performance portability, load balancing, and scalability prediction on heterogeneous systems

Impact of asynchronism on GPU accelerated parallel iterative computations

Impact of communication times on mixed CPU/GPU applications scheduling using KAAPI

Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation

Impact of Floating-Point Precision on Boundary Layer Instabilities Modeled on Fermi GPU

Impact of GPU Memory Access Patterns on FDTD

Impact of Modern OpenGL on FPS

Impact of the channel count on the nonlinear tolerance in coherently-detected POLMUX-QPSK modulation
Impact of Warp Formation on GPU Performance

Impacts of Parallel Programming on Limited-Resource Hardware

Implementability of shading models for current game engines
Implementation & Parallelisation of FDTD code for Electromagnetic Scattering

Implementation and Analysis of AES Encryption on GPU

Implementation and Evaluation of Recurrence Equation Solvers on GPGPU systems using Rearrangement of Array Configurations

Implementation and Evaluation of Scientific Simulations on High Performance Computing Architectures

Implementation and evaluation of various demons deformable image registration algorithms on GPU

Implementation and Experimental Evaluation of a CUDA Core under Single Event Effects

Implementation and Optimization of Image Processing Algorithms on Embedded GPU

Implementation and optimization of image processing algorithms on handheld GPU
Implementation and Performance Analysis of Many-body Quantum Chemical Methods on the Intel Xeon Phi Coprocessor and NVIDIA GPU Accelerator

Implementation and Performance Analysis of SEAL Encryption on FPGA, GPU and Multi-core Processors
Implementation and performance analysis of the AXPY, DOT, and SpMV functions on Intel Xeon Phi and NVIDIA Tesla using OpenCL

Implementation and performance evaluation of a GPU particle-in-cell code

Implementation and performance evaluation of reconstruction algorithms on graphics processors
Implementation Details of GPU-based Out-of-Core Many-Lights Rendering

Implementation of 2-D Discrete Cosine Transform Algorithm on GPU

Implementation of 3D FFTs Across Multiple GPUs in Shared Memory Environments

Implementation of 3D Monte Carlo PET reconstruction algorithm on GPU

Implementation of 802.11n on 128-CORE Processor

Implementation of a 3GPP LTE turbo decoder accelerator on GPU
Implementation of a distributed real-time video panorama pipeline for creating high quality virtual views

Implementation of a Fast Image Coding and Retrieval System Using a GPU

Implementation of a High Throughput 3GPP Turbo Decoder on GPU

Implementation of a High Throughput Soft MIMO Detector on GPU

Implementation of a Lattice Boltzmann kernel using the Compute Unified Device Architecture developed by nVIDIA

Implementation of a Lattice–Boltzmann method for numerical fluid mechanics using the nVIDIA CUDA technology

Implementation of a motion estimation algorithm for Intel FPGAs using OpenCL

Implementation of a Multi-User Detector for Satellite Return Links on a GPU Platform

Implementation of a multigrid solver on GPU for Stokes equations with strongly variable viscosity based on Matlab and CUDA

Implementation of a Parallel Tree Method on a GPU

Implementation of a PIC simulation using WebGL

Implementation of a Power Efficient Synthetic Aperture Radar Back Projection Algorithm on FPGAs Using OpenCL

Implementation of a Practical Distributed Calculation System with Browsers and JavaScript, and Application to Distributed Deep Learning

Implementation of a programming environment with a multithread model for reconfigurable systems
Implementation of a Soft Morphological Filter Based on GPU Framework
Implementation of Advanced Encryption Standard for encryption and decryption of images and text on a GPU

Implementation of algorithms for relativistic hydrodynamics using graphics processing units in CUDA framework

Implementation of algorithms with a fine-grained parallelism on GPUs
Implementation of Ant Colony Algorithm Based on GPU
Implementation of association rule mining using CUDA
Implementation of Autoencoders with Systolic Arrays through OpenCL

Implementation Of Decoders for LDPC Block Codes and LDPC Convolutional Codes Based on GPUs

Implementation of Diamond Search Algorithm Using Parallel Processing Architecture

Implementation of digital down converter in GPU

Implementation of Fast Artificial Neural Network for Pattern Classification on Heterogeneous System

Implementation of FDTD-Compatible Green’s Function on Heterogeneous CPU-GPU Parallel Processing System

Implementation of Filtering Beamforming Algorithms for Sonar Devices Using GPU

Implementation of float-float operators on graphics hardware

Implementation of Frequency Domain Convolution for the Caffe-Framework

Implementation of high speed hash function Keccak on GPU

Implementation of Jacobi iterative method on graphics processor unit
Implementation of Just In Time Value Specialization for the Optimization of Data Parallel Kernels

Implementation of k-Means Clustering Algorithm in CUDA

Implementation of K-shortest Path Algorithm in GPU Using CUDA

Implementation of Kd-Trees on the GPU to Achieve Real Time Graphics Processing

Implementation of Keccak hash function in Tree hashing mode on Nvidia GPU

Implementation of Kernel Methods on the GPU

Implementation of Kirchhoff prestack depth migration on GPU

Implementation of large-scale FIR adaptive filters on NVIDIA GeForce graphics processing unit

Implementation of LTE Mini receiver on GPUs

Implementation of Massive Artificial Neural Networks with CUDA

Implementation of medical image segmentation in CUDA

Implementation of Motion Estimation Based on Heterogeneous Parallel Computing System with OpenCL

Implementation of Parallel Fast Hartley Transform (FHT) Using Cuda

Implementation of Parallel Genetic Algorithms on Graphics Processing Units

Implementation of Parallel Simplified Swarm Optimization in CUDA

Implementation of PDE models of cardiac dynamics on GPUs using OpenCL

Implementation of QR Updating Algorithms on the GPU

Implementation of random linear network coding on OpenGL-enabled graphics cards

Implementation of Sequential Importance Sampling in GPGPU

Implementation of Smith-Waterman Algorithm in OpenCL for GPUs

Implementation of Smith-Waterman algorithm in OpenCL for GPUs

Implementation of Spectral Angle Mapper (SAM) Algorithm on a Graphic processing unit (GPU)

Implementation of Stereo Matching Using High Level Compiler for Parallel Computing Acceleration

Implementation of stereophonic acoustic echo canceller on nVIDIA GeForce graphics processing unit

Implementation of the "Local Rank Differences" Image Feature Using SIMD Instructions of CPU

Implementation of the FDTD Method Based on Lorentz-Drude Dispersive Model on GPU for Plasmonics Applications

Titles: 100
open PDFs: 84
packages: 8
