Papers on hgpu.org (.txt-file)
Implementation of PDE models of cardiac dynamics on GPUs using OpenCL
Implementation of QR Updating Algorithms on the GPU
Implementation of random linear network coding on OpenGL-enabled graphics cards
Implementation of Sequential Importance Sampling in GPGPU
Implementation of Smith-Waterman Algorithm in OpenCL for GPUs
Implementation of Smith-Waterman algorithm in OpenCL for GPUs
Implementation of Spectral Angle Mapper (SAM) Algorithm on a Graphic processing unit (GPU)
Implementation of Stereo Matching Using High Level Compiler for Parallel Computing Acceleration
Implementation of stereophonic acoustic echo canceller on nVIDIA GeForce graphics processing unit
Implementation of the "Local Rank Differences" Image Feature Using SIMD Instructions of CPU
Implementation of the FDTD Method Based on Lorentz-Drude Dispersive Model on GPU for Plasmonics Applications
Implementation of the genetic algorithm by means of CUDA technology involved in travelling salesman problem
Implementation of the Lucas-Kanade image registration algorithm on a GPU for 3D computational platform stabilisation
Implementation of the Neuberger-Dirac operator on GPUs
Implementation of the optimization algorithms on GPGPU architecture and multi-cores
Implementation of the r.cuda.los module in the open source GRASS GIS by using parallel computation on the NVIDIA CUDA graphic cards
Implementation of the SYCL Heterogeneous Computing Library
Implementation of the twisted mass fermion operator in the QUDA library
Implementation of Variable Preconditioned GCR with mixed precision on GPU using CUDA
Implementation of Virtual Embryology using the Thrust library for CUDA
Implementation Techniques for SPMD Kernels on CPUs
Implementations of a Parallel Algorithm for Computing Euclidean Distance Map in Multicore Processors and GPUs
Implementations of hardware acceleration for MD4-family algorithms based on GPU
Implementations of Parallel Computation of Euclidean Distance Map in Multicore Processors and GPUs
Implementations of the FFT algorithm on GPU
Implementations of the Hough Transform on the Embedded Multicore Processors
Implementing a Code Generator for Fast Matrix Multiplication in OpenCL on the GPU
Implementing a Finite Difference-Based Real-time Sound Synthesizer using GPUs
Implementing a GPU Programming Model on a non-GPU Accelerator Architecture
Implementing a GPU-Enhanced Cluster for Large-Scale Simulations
Implementing a Photorealistic Rendering System using GLSL
Implementing a Preconditioned Iterative Linear Solver Using Massively Parallel Graphics Processing Units
Implementing a Sparse Matrix Vector Product for the SELL-C/SELL-C-sigma formats on NVIDIA GPUs
Implementing AES on GPU: Final Report
Implementing an architecture for efficient network traffic processing on modern graphics hardware
Implementing an efficient method of check-pointing on CPU-GPU
Implementing an embedded GPU language by combining translation and generation
Implementing an Interior Point Method for Linear Programs on a CPU-GPU System
Implementing and evaluating an heterogeneous, scalable, tridiagonal linear system solver with OpenCL to target FPGAs, GPUs, and CPUs
Implementing and Evaluating Candidate-Based Invariant Generation
Implementing cartesian genetic programming classifiers on graphics processing units using GPU.NET
Implementing CFD (Computational Fluid Dynamics) in OpenCL for Building Simulation
Implementing Computer Vision Functions with OpenCL on the Qualcomm Adreno 420
Implementing Continuous Integration Software in an Established Computational Chemistry Software Package
Implementing Decision Trees and Forests on a GPU
Implementing Deep Neural Networks for Financial Market Prediction on the Intel Xeon Phi
Implementing density functional theory (DFT) methods on many-core GPGPU accelerators
Implementing Domain-Specific Languages for Heterogeneous Parallel Computing
Implementing Efficient, Portable Computations for Machine Learning
Implementing general matrix-matrix multiplication algorithm on the Intel Xeon Phi Knights Landing Processor
Implementing Genetic Algorithms to CUDA Environment Using Data Parallelization
Implementing implicit OpenMP data sharing on GPUs
Implementing Independent Component Analysis in General-Purpose GPU Architectures
Implementing Interactive 3D Segmentation on CUDA Using Graph-Cuts and Watershed Transformation
Implementing Level-3 BLAS Routines in OpenCL on Different Processing Units
Implementing LNS using filtering units of GPUs
Implementing Machine Learning Algorithms on GPUs for Real-Time Traffic Sign Classification
Implementing mesh-based approaches for deformable objects on GPU
Implementing modular arithmetic using OpenCL
Implementing Molecular Dynamics on Hybrid High Performance Computers – Particle-Particle Particle-Mesh
Implementing molecular dynamics on hybrid high performance computers – short range forces
Implementing Molecular Dynamics on Hybrid High Performance Computers – Three-Body Potentials
Implementing Neural Networks Efficiently
Implementing Open-Source CUDA Runtime
Implementing Parallel SMO to Train SVM on CUDA-Enabled Systems
Implementing Push-Pull Efficiently in GraphBLAS
Implementing QR Factorization Updating Algorithms on GPUs
Implementing sparse matrix-vector multiplication on throughput-oriented processors
Implementing Sparse Matrix-Vector multiplication using CUDA based on a hybrid sparse matrix format
Implementing Sparse Matrix-Vector Multiplication with QCSR on GPU
Implementing Stereo Vision of GPU-Accelerated Scientific Simulations using Commodity Hardware
Implementing Strassen’s Algorithm with CUTLASS on NVIDIA Volta GPUs
Implementing the Approximate Message Passing (AMP) Algorithm on a GPU
Implementing the Himeno benchmark with CUDA on GPU clusters
Implementing the PGI Accelerator model
Implementing the Projected Spatial Rich Features on a GPU
Implementing Ultrasound Beamforming on the GPU using CUDA
Implications of the Turing completeness of reaction-diffusion models, informed by GPGPU simulations on an XBox 360: cardiac arrhythmias, re-entry and the Halting problem
Implicit Adaptive Volume Ray Casting
Implicit and dynamic trees for high performance rendering
Implicit Boundary Control of Vector Field Based Shape Deformations
Implicit Feature-Based Alignment System for Radiotherapy
Implicit Methods for Real-Time simulation of Interactive Waves
Implicit Parallel Time Integrators
Implicit Skinning: Real-Time Skin Deformation with Contact Modeling
Importance of Data Loading Pipeline in Training Deep Neural Networks
Importance of Explicit Vectorization for CPU and GPU Software Performance
Importance Point Projection for GPU-based Final Gathering
Importance sampling algorithms for first passage time probabilities in the infinite server queue
Importance Sampling of Realistic Light Sources
Importance-driven compositing window management
Importance-Driven Isosurface Decimation for Visualization of Large Simulation Data Based on OpenCL
Importance-Driven Particle Techniques for Flow Visualization
Impostors and pseudo-instancing for GPU crowd rendering
Impostors, Pseudo-instancing and Image Maps for GPU Crowd Rendering
Improved automated lattice perturbation theory in background field gauge
Improved Distance Weighted GPU-based 3D Ultrasound Reconstruction Methods
Improved FCM algorithm for Clustering on Web Usage Mining
Improved Finite Difference Schemes for a 3-D Viscothermal Wave Equation on a GPU
Titles: 100
open PDFs: 88
packages: 14