Papers on hgpu.org (.txt-file)
Identifying the Key Features of Intel Xeon Phi: A Comparative Approach
IgNet. A Super-precise Convolutional Neural Network
Ignite-GPU: a GPU-enabled in-memory computing architecture on clusters
iGniter: Interference-Aware GPU Resource Provisioning for Predictable DNN Inference in the Cloud
iGPU: Exception Support and Speculative Execution on GPUs
iGUARD: In-GPU Advanced Race Detection
Ikra-Cpp: A C++/CUDA DSL for Object-Oriented Programming with Structure-of-Arrays Layout
Ilargi: a GPU Compatible Factorized ML Model Training Framework
Illustrative Rendering of Particle Systems
Illustrative Volume Visualization Using GPU-Based Particle Systems
Image and Video Processing on CUDA: State of the Art and Future Directions
Image and Video Processing on GPU: Implementation Scheme, Applications and Future Directions
Image Classification with Pyramid Representation and Rotated Data Augmentation on Torch 7
Image Convolution Processing: a GPU versus FPGA Comparison
Image Denoising Using Wavelet Transform and CUDA
Image Encryption Using Parallel RSA Algorithm on CUDA
Image Noise Removal on Heterogeneous CPU-GPU Configurations
Image Object Tracking System Using Parallel Mean Shift Algorithm
Image parallel processing based on GPU
Image processing algorithm optimization with CUDA for Pure Data
Image processing applications on a low power highly parallel SIMD architecture
Image Processing on Graphical Processing Units for faster DNA Sequencing
Image Processing using Parallel Computing
Image reconstruction in digital holographic microscopy on GPU
Image representation by blob and its application in CT reconstruction from few projections
Image segmentation using CUDA implementations of the Runge-Kutta-Merson and GMRES methods
Image selection for improved Multi-View Stereo
Image spatial diffusion on GPUs
Image super-resolution by vectorizing edges
Image Super-Resolution Using Deep Convolutional Networks
Image-based fast three-dimensional leaf modeling
Image-Based Material Restyling with Fast Non-local Means Filtering
Image-Based Proxy Accumulation for Real-Time Soft Global Illumination
Image-Space Caustics and Curvatures
Image-Space Collision Detection Through Alternate Surface Peeling
Image-Space GPU Metaballs for Time-Dependent Particle Data Sets
ImageCL: An Image Processing Language for Performance Portability on Heterogeneous Systems
ImageCL: Language and source-to-source compiler for performance portability, load balancing, and scalability prediction on heterogeneous systems
Impact of asynchronism on GPU accelerated parallel iterative computations
Impact of communication times on mixed CPU/GPU applications scheduling using KAAPI
Impact of data layouts on the efficiency of GPU-accelerated IDW interpolation
Impact of Floating-Point Precision on Boundary Layer Instabilities Modeled on Fermi GPU
Impact of GPU Memory Access Patterns on FDTD
Impact of Modern OpenGL on FPS
Impact of the channel count on the nonlinear tolerance in coherently-detected POLMUX-QPSK modulation
Impact of Warp Formation on GPU Performance
Impacts of Parallel Programming on Limited-Resource Hardware
Implementability of shading models for current game engines
Implementation & Parallelisation of FDTD code for Electromagnetic Scattering
Implementation and Analysis of AES Encryption on GPU
Implementation and Evaluation of Recurrence Equation Solvers on GPGPU systems using Rearrangement of Array Configurations
Implementation and Evaluation of Scientific Simulations on High Performance Computing Architectures
Implementation and evaluation of various demons deformable image registration algorithms on GPU
Implementation and Experimental Evaluation of a CUDA Core under Single Event Effects
Implementation and Optimization of Image Processing Algorithms on Embedded GPU
Implementation and optimization of image processing algorithms on handheld GPU
Implementation and Performance Analysis of Many-body Quantum Chemical Methods on the Intel Xeon Phi Coprocessor and NVIDIA GPU Accelerator
Implementation and Performance Analysis of SEAL Encryption on FPGA, GPU and Multi-core Processors
Implementation and performance analysis of the AXPY, DOT, and SpMV functions on Intel Xeon Phi and NVIDIA Tesla using OpenCL
Implementation and performance evaluation of a GPU particle-in-cell code
Implementation and performance evaluation of reconstruction algorithms on graphics processors
Implementation Details of GPU-based Out-of-Core Many-Lights Rendering
Implementation of 2-D Discrete Cosine Transform Algorithm on GPU
Implementation of 3D FFTs Across Multiple GPUs in Shared Memory Environments
Implementation of 3D Monte Carlo PET reconstruction algorithm on GPU
Implementation of 802.11n on 128-CORE Processor
Implementation of a 3GPP LTE turbo decoder accelerator on GPU
Implementation of a distributed real-time video panorama pipeline for creating high quality virtual views
Implementation of a Fast Image Coding and Retrieval System Using a GPU
Implementation of a High Throughput 3GPP Turbo Decoder on GPU
Implementation of a High Throughput Soft MIMO Detector on GPU
Implementation of a Lattice Boltzmann kernel using the Compute Unified Device Architecture developed by nVIDIA
Implementation of a Lattice–Boltzmann method for numerical fluid mechanics using the nVIDIA CUDA technology
Implementation of a motion estimation algorithm for Intel FPGAs using OpenCL
Implementation of a Multi-User Detector for Satellite Return Links on a GPU Platform
Implementation of a multigrid solver on GPU for Stokes equations with strongly variable viscosity based on Matlab and CUDA
Implementation of a Parallel Tree Method on a GPU
Implementation of a PIC simulation using WebGL
Implementation of a Power Efficient Synthetic Aperture Radar Back Projection Algorithm on FPGAs Using OpenCL
Implementation of a Practical Distributed Calculation System with Browsers and JavaScript, and Application to Distributed Deep Learning
Implementation of a programming environment with a multithread model for reconfigurable systems
Implementation of a Soft Morphological Filter Based on GPU Framework
Implementation of Advanced Encryption Standard for encryption and decryption of images and text on a GPU
Implementation of algorithms for relativistic hydrodynamics using graphics processing units in CUDA framework
Implementation of algorithms with a fine-grained parallelism on GPUs
Implementation of Ant Colony Algorithm Based on GPU
Implementation of association rule mining using CUDA
Implementation of Autoencoders with Systolic Arrays through OpenCL
Implementation Of Decoders for LDPC Block Codes and LDPC Convolutional Codes Based on GPUs
Implementation of Diamond Search Algorithm Using Parallel Processing Architecture
Implementation of digital down converter in GPU
Implementation of Fast Artificial Neural Network for Pattern Classification on Heterogeneous System
Implementation of FDTD-Compatible Green’s Function on Heterogeneous CPU-GPU Parallel Processing System
Implementation of Filtering Beamforming Algorithms for Sonar Devices Using GPU
Titles: 100
open PDFs: 82
packages: 11