Papers on hgpu.org (.txt-file)
Efficient Exploitation of Heterogeneous Platforms for Vertebra Detection in X-Ray Images
Efficient fault simulation on many-core processors
Efficient FFT mapping on GPU for radar processing application: modeling and implementation
Efficient fine grained shared buffer management for multiple OpenCL devices
Efficient Finite Element Geometric Multigrid Solvers for Unstructured Grids on GPUs
Efficient floating-point texture decompression
Efficient fMRI Analysis and Clustering on GPUs
Efficient gather and scatter operations on graphics processors
Efficient Geometry Compression for GPU-based Decoding in Realtime Terrain Rendering
Efficient GPGPU-based parallel packet classification
Efficient GPU Implementation for Particle in Cell Algorithm
Efficient GPU Implementation for Single Block Orthogonal Dictionary Learning
Efficient GPU implementation of a class of array permutations
Efficient GPU implementation of a two waves WAF method for the two-dimensional one layer Shallow Water system on structured meshes
Efficient GPU implementation of parameter estimation of a statistical model for online advertisement optimization
Efficient GPU implementation of the integral histogram
Efficient GPU-Accelerated Elastic Image Registration
Efficient GPU-based Construction of Occupancy Girds Using several Laser Range-finders
Efficient GPU-based Graph Cuts for Stereo Matching
Efficient GPU-Based Texture Interpolation using Uniform B-Splines
Efficient GPU-based Training of Recurrent Neural Network Language Models Using Spliced Sentence Bunch
Efficient GPU-Implementation of Adaptive Mesh Refinement for the Shallow-Water Equations
Efficient gradient-domain compositing using quadtrees
Efficient Graph Comparison and Visualization Using GPU
Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration
Efficient Hardware Acceleration on SoC-FPGA with OpenCL
Efficient Hash Tables on the GPU
Efficient Heterogeneous Execution on Large Multicore and Accelerator Platforms: Case Study Using a Block Tridiagonal Solver
Efficient heterogeneous matrix profile on a CPU + High Performance FPGA with integrated HBM
Efficient hierarchical parallel genetic algorithms using grid computing
Efficient High-Quality Volume Rendering of SPH Data
Efficient High-Speed WPA2 Brute Force Attacks using Scalable Low-Cost FPGA Clustering
Efficient Hybrid Execution of C++ Applications using Intel(R) Xeon Phi(TM) Coprocessor
Efficient image reconstruction for point-based and line-based rendering
Efficient Implementation and Evaluation of Methods for the Estimation of Motion in Image Sequences
Efficient Implementation and Optimization of Geometric Multigrid Operations in the LIFT Framework
Efficient implementation for MD5-RC4 encryption using GPU with CUDA
Efficient implementation for QUAD stream cipher with GPUs
Efficient Implementation of Bi-directional Path Tracer on GPU
Efficient implementation of computationally intensive algorithms on parallel computing platforms
Efficient implementation of data flow graphs on multi-gpu clusters
Efficient implementation of GPGPU synchronization primitives on CPUs
Efficient Implementation of Hyperspectral Anomaly Detection Techniques on GPUs and Multicore Processors
Efficient Implementation of MrBayes on multi-GPU
Efficient implementation of multiuser precoding algorithms on GPU for MIMO-OFDM systems
Efficient Implementation of Optical Flow Algorithm Based on Directional Filters on a GPU Using CUDA
Efficient Implementation of RLS-Based Adaptive Filters on nVIDIA GeForce Graphics Processing Unit
Efficient Implementation of the CPR Formulation for the Navier-Stokes Equations on GPUs
Efficient Implementation of the eta_T Pairing on GPU
Efficient implementation of the overlap operator on multi-GPUs
Efficient Implementation of the Simplex Method on a CPU-GPU System
Efficient Incremental Text-to-Speech on GPUs
Efficient Independent Component Analysis on a GPU
Efficient Inference For Neural Machine Translation
Efficient Integral Image Computation on the GPU
Efficient Interleaved Batch Matrix Solvers for CUDA
Efficient Intranode Communication in GPU-Accelerated Systems
Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines
Efficient JPEG2000 EBCOT Context Modeling for Massively Parallel Architectures
Efficient Kernel Fusion Techniques for Massive Video Data Analysis on GPGPUs
Efficient Kernel Synthesis for Performance Portable Programming
Efficient Knowledge Extraction from Structured Data
Efficient Large-scale Approximate Nearest Neighbor Search on OpenCL FPGA
Efficient Large-scale Approximate Nearest Neighbor Search on the GPU
Efficient Large-Scale Graph Processing on Hybrid CPU and GPU Systems
Efficient Large-Scale Language Model Training on GPU Clusters
Efficient LBM Visual Simulation on Face-Centered Cubic Lattices
Efficient linear-scaling quantum transport calculations on graphics processing units and applications on electron transport in graphene
Efficient lists intersection by CPU-GPU cooperative computing
Efficient magnetohydrodynamic simulations on graphics processing units with CUDA
Efficient Mapping of Streaming Applications for Image Processing on Graphics Cards
Efficient mapping of the training of Convolutional Neural Networks to a CUDA-based cluster
Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems
Efficient MIMD architectures for high-performance ray tracing
Efficient Model-based 3D Tracking of Hand Articulations using Kinect
Efficient molecular dynamics simulations with many-body potentials on graphics processing units
Efficient Monte Carlo sampler for detecting parametric objects in large scenes
Efficient MPI-based Communication for GPU-Accelerated Dask Applications
Efficient Multi-GPU Algorithm for All-Pairs Shortest Paths
Efficient Multi-GPU Computation of All-Pairs Shortest Paths
Efficient Multiplication of Polynomials on Graphics Hardware
Efficient nearest-neighbor computation for GPU-based motion planning
Efficient Nearest-Neighbor Data Sharing in GPUs
Efficient Neural Network Acceleration on GPGPU using Content Addressable Memory
Efficient nonbonded interactions for molecular dynamics on a graphics processing unit
Efficient Numerical Evaluation of Feynman Integral
Efficient occupancy grid computation on the GPU with lidar and radar for road boundary detection
Efficient On-the-fly Category Retrieval using ConvNets and GPUs
Efficient OpenCL system integration of non-blocking FPGA accelerators
Efficient OpenCL-based concurrent tasks offloading on accelerators
Efficient PageRank and SpMV Computation on AMD GPUs
Efficient Parallel Algorithm for Nonlinear Dimensionality Reduction on GPU
Efficient parallel algorithms for maximum-density segment problem
Efficient Parallel and External Matching
Efficient Parallel CKY Parsing on GPUs
Efficient Parallel Evaluation of Multivariate Quadratic Polynomials on GPUs
Efficient Parallel Graph Exploration on Multi-Core CPU and GPU
Efficient Parallel Implementation for Single Block Orthogonal Dictionary Learning
Efficient Parallel Implementation of Active Appearance Model Fitting Algorithm on GPU
Titles: 100
open PDFs: 91
packages: 14