Papers on hgpu.org (.txt-file)
Impact of GPU Memory Access Patterns on FDTD

Impact of Modern OpenGL on FPS

Impact of the channel count on the nonlinear tolerance in coherently-detected POLMUX-QPSK modulation
Impact of Warp Formation on GPU Performance

Impacts of Parallel Programming on Limited-Resource Hardware

Implementability of shading models for current game engines
Implementation & Parallelisation of FDTD code for Electromagnetic Scattering

Implementation and Analysis of AES Encryption on GPU

Implementation and Evaluation of Recurrence Equation Solvers on GPGPU systems using Rearrangement of Array Configurations

Implementation and Evaluation of Scientific Simulations on High Performance Computing Architectures

Implementation and evaluation of various demons deformable image registration algorithms on GPU

Implementation and Experimental Evaluation of a CUDA Core under Single Event Effects

Implementation and Optimization of Image Processing Algorithms on Embedded GPU

Implementation and optimization of image processing algorithms on handheld GPU
Implementation and Performance Analysis of Many-body Quantum Chemical Methods on the Intel Xeon Phi Coprocessor and NVIDIA GPU Accelerator

Implementation and Performance Analysis of SEAL Encryption on FPGA, GPU and Multi-core Processors
Implementation and performance analysis of the AXPY, DOT, and SpMV functions on Intel Xeon Phi and NVIDIA Tesla using OpenCL

Implementation and performance evaluation of a GPU particle-in-cell code

Implementation and performance evaluation of reconstruction algorithms on graphics processors
Implementation Details of GPU-based Out-of-Core Many-Lights Rendering

Implementation of 2-D Discrete Cosine Transform Algorithm on GPU

Implementation of 3D FFTs Across Multiple GPUs in Shared Memory Environments

Implementation of 3D Monte Carlo PET reconstruction algorithm on GPU

Implementation of 802.11n on 128-CORE Processor

Implementation of a 3GPP LTE turbo decoder accelerator on GPU
Implementation of a distributed real-time video panorama pipeline for creating high quality virtual views

Implementation of a Fast Image Coding and Retrieval System Using a GPU

Implementation of a High Throughput 3GPP Turbo Decoder on GPU

Implementation of a High Throughput Soft MIMO Detector on GPU

Implementation of a Lattice Boltzmann kernel using the Compute Unified Device Architecture developed by nVIDIA

Implementation of a Lattice–Boltzmann method for numerical fluid mechanics using the nVIDIA CUDA technology

Implementation of a motion estimation algorithm for Intel FPGAs using OpenCL

Implementation of a Multi-User Detector for Satellite Return Links on a GPU Platform

Implementation of a multigrid solver on GPU for Stokes equations with strongly variable viscosity based on Matlab and CUDA

Implementation of a Parallel Tree Method on a GPU

Implementation of a PIC simulation using WebGL

Implementation of a Power Efficient Synthetic Aperture Radar Back Projection Algorithm on FPGAs Using OpenCL

Implementation of a Practical Distributed Calculation System with Browsers and JavaScript, and Application to Distributed Deep Learning

Implementation of a programming environment with a multithread model for reconfigurable systems
Implementation of a Soft Morphological Filter Based on GPU Framework
Implementation of Advanced Encryption Standard for encryption and decryption of images and text on a GPU

Implementation of algorithms for relativistic hydrodynamics using graphics processing units in CUDA framework

Implementation of algorithms with a fine-grained parallelism on GPUs
Implementation of Ant Colony Algorithm Based on GPU
Implementation of association rule mining using CUDA
Implementation of Autoencoders with Systolic Arrays through OpenCL

Implementation Of Decoders for LDPC Block Codes and LDPC Convolutional Codes Based on GPUs

Implementation of Diamond Search Algorithm Using Parallel Processing Architecture

Implementation of digital down converter in GPU

Implementation of Fast Artificial Neural Network for Pattern Classification on Heterogeneous System

Implementation of FDTD-Compatible Green’s Function on Heterogeneous CPU-GPU Parallel Processing System

Implementation of Filtering Beamforming Algorithms for Sonar Devices Using GPU

Implementation of float-float operators on graphics hardware

Implementation of Frequency Domain Convolution for the Caffe-Framework

Implementation of high speed hash function Keccak on GPU

Implementation of Jacobi iterative method on graphics processor unit
Implementation of Just In Time Value Specialization for the Optimization of Data Parallel Kernels

Implementation of k-Means Clustering Algorithm in CUDA

Implementation of K-shortest Path Algorithm in GPU Using CUDA

Implementation of Kd-Trees on the GPU to Achieve Real Time Graphics Processing

Implementation of Keccak hash function in Tree hashing mode on Nvidia GPU

Implementation of Kernel Methods on the GPU

Implementation of Kirchhoff prestack depth migration on GPU

Implementation of large-scale FIR adaptive filters on NVIDIA GeForce graphics processing unit

Implementation of LTE Mini receiver on GPUs

Implementation of Massive Artificial Neural Networks with CUDA

Implementation of medical image segmentation in CUDA

Implementation of Motion Estimation Based on Heterogeneous Parallel Computing System with OpenCL

Implementation of Parallel Fast Hartley Transform (FHT) Using Cuda

Implementation of Parallel Genetic Algorithms on Graphics Processing Units

Implementation of Parallel Simplified Swarm Optimization in CUDA

Implementation of PDE models of cardiac dynamics on GPUs using OpenCL

Implementation of QR Updating Algorithms on the GPU

Implementation of random linear network coding on OpenGL-enabled graphics cards

Implementation of Sequential Importance Sampling in GPGPU

Implementation of Smith-Waterman Algorithm in OpenCL for GPUs

Implementation of Smith-Waterman algorithm in OpenCL for GPUs

Implementation of Spectral Angle Mapper (SAM) Algorithm on a Graphic processing unit (GPU)

Implementation of Stereo Matching Using High Level Compiler for Parallel Computing Acceleration

Implementation of stereophonic acoustic echo canceller on nVIDIA GeForce graphics processing unit

Implementation of the "Local Rank Differences" Image Feature Using SIMD Instructions of CPU

Implementation of the FDTD Method Based on Lorentz-Drude Dispersive Model on GPU for Plasmonics Applications

Implementation of the genetic algorithm by means of CUDA technology involved in travelling salesman problem

Implementation of the Lucas-Kanade image registration algorithm on a GPU for 3D computational platform stabilisation

Implementation of the Neuberger-Dirac operator on GPUs

Implementation of the optimization algorithms on GPGPU architecture and multi-cores

Implementation of the r.cuda.los module in the open source GRASS GIS by using parallel computation on the NVIDIA CUDA graphic cards

Implementation of the SYCL Heterogeneous Computing Library

Implementation of the twisted mass fermion operator in the QUDA library

Implementation of Variable Preconditioned GCR with mixed precision on GPU using CUDA
Implementation of Virtual Embryology using the Thrust library for CUDA

Implementation Techniques for SPMD Kernels on CPUs

Implementations of a Parallel Algorithm for Computing Euclidean Distance Map in Multicore Processors and GPUs

Implementations of hardware acceleration for MD4-family algorithms based on GPU
Implementations of Parallel Computation of Euclidean Distance Map in Multicore Processors and GPUs
Implementations of the FFT algorithm on GPU

Implementations of the Hough Transform on the Embedded Multicore Processors

Titles: 100
open PDFs: 82
packages: 7
