Papers on hgpu.org (.txt-file)
Efficient Large-Scale Graph Processing on Hybrid CPU and GPU Systems

Efficient Large-Scale Language Model Training on GPU Clusters

Efficient LBM Visual Simulation on Face-Centered Cubic Lattices

Efficient linear-scaling quantum transport calculations on graphics processing units and applications on electron transport in graphene

Efficient lists intersection by CPU-GPU cooperative computing

Efficient magnetohydrodynamic simulations on graphics processing units with CUDA

Efficient Mapping of Streaming Applications for Image Processing on Graphics Cards

Efficient mapping of the training of Convolutional Neural Networks to a CUDA-based cluster

Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems

Efficient MIMD architectures for high-performance ray tracing

Efficient Model-based 3D Tracking of Hand Articulations using Kinect

Efficient molecular dynamics simulations with many-body potentials on graphics processing units

Efficient Monte Carlo sampler for detecting parametric objects in large scenes

Efficient MPI-based Communication for GPU-Accelerated Dask Applications

Efficient Multi-GPU Algorithm for All-Pairs Shortest Paths

Efficient Multi-GPU Computation of All-Pairs Shortest Paths

Efficient Multiplication of Polynomials on Graphics Hardware

Efficient nearest-neighbor computation for GPU-based motion planning

Efficient Nearest-Neighbor Data Sharing in GPUs

Efficient Neural Network Acceleration on GPGPU using Content Addressable Memory

Efficient nonbonded interactions for molecular dynamics on a graphics processing unit
Efficient Numerical Evaluation of Feynman Integral

Efficient occupancy grid computation on the GPU with lidar and radar for road boundary detection
Efficient On-the-fly Category Retrieval using ConvNets and GPUs

Efficient OpenCL system integration of non-blocking FPGA accelerators

Efficient OpenCL-based concurrent tasks offloading on accelerators

Efficient PageRank and SpMV Computation on AMD GPUs

Efficient Parallel Algorithm for Nonlinear Dimensionality Reduction on GPU
Efficient parallel algorithms for maximum-density segment problem

Efficient Parallel and External Matching

Efficient Parallel CKY Parsing on GPUs

Efficient Parallel Evaluation of Multivariate Quadratic Polynomials on GPUs

Efficient Parallel Graph Exploration on Multi-Core CPU and GPU

Efficient Parallel Implementation for Single Block Orthogonal Dictionary Learning

Efficient Parallel Implementation of Active Appearance Model Fitting Algorithm on GPU

Efficient parallel implementation of the lattice Boltzmann method on large clusters of graphic processing units

Efficient Parallel Intra-prediction Mode Selection Scheme for 4×4 Blocks in H.264
Efficient parallel lists intersection and index compression algorithms using graphics processing units

Efficient Parallel Methods for Deep Reinforcement Learning

Efficient Parallel Nonnegative Least Squares on Multicore Architectures

Efficient Parallel Proximity Queries and an Application to Highly Complex Motion Planning Problems with Many Narrow Passages

Efficient Parallel RSA Decryption Algorithm for Many-core GPUs with CUDA

Efficient Parallel Scan Algorithms for GPUs

Efficient Parallel Strategy Improvement for Parity Games

Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation

Efficient Parallelization of Natural Language Applications using GPUs

Efficient Parallelization of Stochastic Simulation Algorithm for Chemically Reacting Systems on the Graphics Processing Unit

Efficient Parallelization of the Stochastic Simulation Algorithm for Chemically Reacting Systems On the Graphics Processing Unit

Efficient parallelized particle filter design on CUDA

Efficient Particle-Mesh Spreading on GPUs

Efficient Partitioning Based Hierarchical Agglomerative Clustering Using Graphics Accelerators with CUDA

Efficient partitioning of fragment shaders for multipass rendering on programmable graphics hardware

Efficient Password and Key recovery using Graphic Cards

Efficient Pattern-Based Time Series Classification on GPU

Efficient Performance Evaluation of Memory Hierarchy for Highly Multithreaded Graphics Processors

Efficient planar features matching for robot localization using GPU

Efficient Preconditioned Conjugate Gradient Parallelization on GPU

Efficient Probabilistic and Geometric Anatomical Mapping Using Particle Mesh Approximation on GPUs

Efficient Probabilistic Latent Semantic Indexing using Graphics Processing Unit

Efficient Probabilistic Model Checking on General Purpose Graphics Processors

Efficient Processing of MRFs for Unconstrained-Pose Face Recognition

Efficient pseudo-random number generation for monte-carlo simulations using graphic processors

Efficient pseudo-random number generators for biomolecular simulations on graphics processors

Efficient Quantized Sparse Matrix Operations on Tensor Cores

Efficient Query Processing in Co-Processor-accelerated Databases

Efficient Quicksort and 2D Convex Hull for CUDA, and MSIMD as a Realistic Model of Massively Parallel Computations

Efficient Radial Pattern Keyword Search on Knowledge Graphs in Parallel

Efficient Random Sampling – Parallel, Vectorized, Cache-Efficient, and Online

Efficient Rasterization for Outdoor Radio Wave Propagation

Efficient Ray Tracing of Dynamic Scenes on the GPU

Efficient Realization of Householder Transform through Algorithm-Architecture Co-design for Acceleration of QR Factorization

Efficient reconfigurable design for pricing asian options

Efficient reconstruction of biological networks via transitive reduction on general purpose graphics processors

Efficient Relational Algebra Algorithms and Data Structures for GPU

Efficient relational database management using graphics processors

Efficient Rendering of Scenes with Dynamic Lighting Using a Photons Queue and Incremental Update Algorithm

Efficient Resource Scheduling for Big Data Processing on Accelerator-based Heterogeneous Systems

Efficient Resource Sharing Through GPU Virtualization on Accelerated High Performance Computing Systems

Efficient scan-window based object detection using GPGPU

Efficient SDS Simulations on Multi-GPU Nodes of XSEDE High-end Clusters

Efficient Shadows for GPU-based Volume Raycasting

Efficient Shallow Water Simulations on GPUs

Efficient shallow water simulations on GPUs: Implementation, visualization, verification, and validation

Efficient SIMD Vectorization for Hashing in OpenCL

Efficient similarity search on multimedia databases

Efficient simulation of agent-based models on multi-GPU and multi-core clusters

Efficient Simulation of Fluid Flow and Transport in Heterogeneous Media Using Graphics Processing Units (GPUs)

Efficient simulation of large-scale spiking neural networks using CUDA graphics processors

Efficient Simulation of Ocean and Land Scenes Based on Digital Earth

Efficient Simulation Techniques for Large-Scale Applications

Efficient simulations of long wave propagation and runup using a LBM approach on GPGPU hardware

Efficient softmax approximation for GPUs

Efficient Sparse Matrix-Vector Multiplication on CUDA

Efficient Sparse Matrix-Vector Multiplication on GPUs using the CSR Storage Format

Efficient Sparse Matrix-Vector Multiplication on x86-Based Many-Core Processors

Efficient sparse voxel octrees

Efficient Sparse Voxel Octrees – Analysis, Extensions, and Implementation

Efficient Sparse-Dense Matrix-Matrix Multiplication on GPUs Using the Customized Sparse Storage Format

Efficient Spatial Anti-Aliasing Rendering for Line Joins on Vector Maps

Titles: 100
open PDFs: 95
packages: 16
