Papers on hgpu.org (.txt-file)
Efficient Implementation of the eta_T Pairing on GPU
Efficient implementation of the overlap operator on multi-GPUs
Efficient Implementation of the Simplex Method on a CPU-GPU System
Efficient Incremental Text-to-Speech on GPUs
Efficient Independent Component Analysis on a GPU
Efficient Inference For Neural Machine Translation
Efficient Integral Image Computation on the GPU
Efficient Interleaved Batch Matrix Solvers for CUDA
Efficient Intranode Communication in GPU-Accelerated Systems
Efficient Irregular Wavefront Propagation Algorithms on Hybrid CPU-GPU Machines
Efficient JPEG2000 EBCOT Context Modeling for Massively Parallel Architectures
Efficient Kernel Fusion Techniques for Massive Video Data Analysis on GPGPUs
Efficient Kernel Synthesis for Performance Portable Programming
Efficient Knowledge Extraction from Structured Data
Efficient Large-scale Approximate Nearest Neighbor Search on OpenCL FPGA
Efficient Large-scale Approximate Nearest Neighbor Search on the GPU
Efficient Large-Scale Graph Processing on Hybrid CPU and GPU Systems
Efficient Large-Scale Language Model Training on GPU Clusters
Efficient LBM Visual Simulation on Face-Centered Cubic Lattices
Efficient linear-scaling quantum transport calculations on graphics processing units and applications on electron transport in graphene
Efficient lists intersection by CPU-GPU cooperative computing
Efficient magnetohydrodynamic simulations on graphics processing units with CUDA
Efficient Mapping of Streaming Applications for Image Processing on Graphics Cards
Efficient mapping of the training of Convolutional Neural Networks to a CUDA-based cluster
Efficient Matrix Factorization on Heterogeneous CPU-GPU Systems
Efficient MIMD architectures for high-performance ray tracing
Efficient Model-based 3D Tracking of Hand Articulations using Kinect
Efficient molecular dynamics simulations with many-body potentials on graphics processing units
Efficient Monte Carlo sampler for detecting parametric objects in large scenes
Efficient MPI-based Communication for GPU-Accelerated Dask Applications
Efficient Multi-GPU Algorithm for All-Pairs Shortest Paths
Efficient Multi-GPU Computation of All-Pairs Shortest Paths
Efficient Multiplication of Polynomials on Graphics Hardware
Efficient nearest-neighbor computation for GPU-based motion planning
Efficient Nearest-Neighbor Data Sharing in GPUs
Efficient Neural Network Acceleration on GPGPU using Content Addressable Memory
Efficient nonbonded interactions for molecular dynamics on a graphics processing unit
Efficient Numerical Evaluation of Feynman Integral
Efficient occupancy grid computation on the GPU with lidar and radar for road boundary detection
Efficient On-the-fly Category Retrieval using ConvNets and GPUs
Efficient OpenCL system integration of non-blocking FPGA accelerators
Efficient OpenCL-based concurrent tasks offloading on accelerators
Efficient PageRank and SpMV Computation on AMD GPUs
Efficient Parallel Algorithm for Nonlinear Dimensionality Reduction on GPU
Efficient parallel algorithms for maximum-density segment problem
Efficient Parallel and External Matching
Efficient Parallel CKY Parsing on GPUs
Efficient Parallel Evaluation of Multivariate Quadratic Polynomials on GPUs
Efficient Parallel Graph Exploration on Multi-Core CPU and GPU
Efficient Parallel Implementation for Single Block Orthogonal Dictionary Learning
Efficient Parallel Implementation of Active Appearance Model Fitting Algorithm on GPU
Efficient parallel implementation of the lattice Boltzmann method on large clusters of graphic processing units
Efficient Parallel Intra-prediction Mode Selection Scheme for 4×4 Blocks in H.264
Efficient parallel lists intersection and index compression algorithms using graphics processing units
Efficient Parallel Methods for Deep Reinforcement Learning
Efficient Parallel Nonnegative Least Squares on Multicore Architectures
Efficient Parallel Proximity Queries and an Application to Highly Complex Motion Planning Problems with Many Narrow Passages
Efficient Parallel RSA Decryption Algorithm for Many-core GPUs with CUDA
Efficient Parallel Scan Algorithms for GPUs
Efficient Parallel Strategy Improvement for Parity Games
Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation
Efficient Parallelization of Natural Language Applications using GPUs
Efficient Parallelization of Stochastic Simulation Algorithm for Chemically Reacting Systems on the Graphics Processing Unit
Efficient Parallelization of the Stochastic Simulation Algorithm for Chemically Reacting Systems On the Graphics Processing Unit
Efficient parallelized particle filter design on CUDA
Efficient Particle-Mesh Spreading on GPUs
Efficient Partitioning Based Hierarchical Agglomerative Clustering Using Graphics Accelerators with CUDA
Efficient partitioning of fragment shaders for multipass rendering on programmable graphics hardware
Efficient Password and Key recovery using Graphic Cards
Efficient Pattern-Based Time Series Classification on GPU
Efficient Performance Evaluation of Memory Hierarchy for Highly Multithreaded Graphics Processors
Efficient planar features matching for robot localization using GPU
Efficient Preconditioned Conjugate Gradient Parallelization on GPU
Efficient Probabilistic and Geometric Anatomical Mapping Using Particle Mesh Approximation on GPUs
Efficient Probabilistic Latent Semantic Indexing using Graphics Processing Unit
Efficient Probabilistic Model Checking on General Purpose Graphics Processors
Efficient Processing of MRFs for Unconstrained-Pose Face Recognition
Efficient pseudo-random number generation for monte-carlo simulations using graphic processors
Efficient pseudo-random number generators for biomolecular simulations on graphics processors
Efficient Quantized Sparse Matrix Operations on Tensor Cores
Efficient Query Processing in Co-Processor-accelerated Databases
Efficient Quicksort and 2D Convex Hull for CUDA, and MSIMD as a Realistic Model of Massively Parallel Computations
Efficient Radial Pattern Keyword Search on Knowledge Graphs in Parallel
Efficient Random Sampling – Parallel, Vectorized, Cache-Efficient, and Online
Efficient Rasterization for Outdoor Radio Wave Propagation
Efficient Ray Tracing of Dynamic Scenes on the GPU
Efficient Realization of Householder Transform through Algorithm-Architecture Co-design for Acceleration of QR Factorization
Efficient reconfigurable design for pricing asian options
Efficient reconstruction of biological networks via transitive reduction on general purpose graphics processors
Efficient Relational Algebra Algorithms and Data Structures for GPU
Efficient relational database management using graphics processors
Efficient Rendering of Scenes with Dynamic Lighting Using a Photons Queue and Incremental Update Algorithm
Efficient Resource Scheduling for Big Data Processing on Accelerator-based Heterogeneous Systems
Efficient Resource Sharing Through GPU Virtualization on Accelerated High Performance Computing Systems
Efficient scan-window based object detection using GPGPU
Efficient SDS Simulations on Multi-GPU Nodes of XSEDE High-end Clusters
Efficient Shadows for GPU-based Volume Raycasting
Efficient Shallow Water Simulations on GPUs
Efficient shallow water simulations on GPUs: Implementation, visualization, verification, and validation
Titles: 100
open PDFs: 93
packages: 14