Papers on hgpu.org (.txt-file)
Exploring the acceleration of Nekbone on reconfigurable architectures
Exploring the Feasibility of Fully Homomorphic Encryption
Exploring The Latency and Bandwidth Tolerance of CUDA Applications
Exploring the Limits of Generic Code Execution on GPUs via Direct (OpenMP) Offload
Exploring the Limits of GPUs With Parallel Graph Algorithms
Exploring the Millennium Run – Scalable Rendering of Large-Scale Cosmological Datasets
Exploring the multiple-GPU design space
Exploring the Multitude of Real-Time Multi-GPU Configurations
Exploring the Optimization Space of Multi-Core Architectures with OpenCL Benchmarks
Exploring the power of GPU’s for training Deep Belief Networks
Exploring the Suitability of Remote GPGPU Virtualization for the OpenACC Programming Model Using rCUDA
Exploring the tradeoffs between programmability and efficiency in data-parallel accelerators
Exploring the use of glossy light volumes for interactive global illumination
Exploring Thread Coarsening on FPGA
Exploring Traditional and Emerging Parallel Programming Models using a Proxy Application
Exploring utilisation of GPU for database applications
Exploring weak scalability for FEM calculations on a GPU-enhanced cluster
Exponential integrators on graphic processing units
Exponential Integrators on Graphics Processing Units
Exposing Errors Related to Weak Memory in GPU Applications
Exposing Fine-Grained Parallelism in Algebraic Multigrid Methods
Exposing non-standard architectures to embedded software using compile-time virtualisation
Exposure Render: An Interactive Photo-Realistic Volume Rendering Framework
Expressed Sequence Tag Clustering using Commercial Gaming Hardware
Expressive Array Constructs in an Embedded GPU Kernel Programming Language
Extendable pattern-oriented optimization directives
Extendable Pattern-Oriented Optimization Directives (extended version)
Extended Data Collection: Analysis of Cache Behavior and Performance of Different BVH Memory Layouts for Tracing Incoherent Rays
Extended Dynamic Programming and Fast Multidimensional Search Algorithm for Energy Minization in Stereo and Motion
Extended-precision floating-point numbers for GPU computation
Extending a C-like Language for Portable SIMD Programming
Extending a Run-time Resource Management framework to support OpenCL and Heterogeneous Systems
Extending abstract GPU APIs to shared memory
Extending adaptive sparse grids for stochastic collocation to hybrid parallel architectures
Extending High-Level Synthesis for Task-Parallel Programs
Extending Lyapack for the Solution of Band Lyapunov Equations on Hybrid CPU-GPU Platforms
Extending MAGMA Portability with OneAPI
Extending OmpSs for OpenCL kernel co-execution in heterogeneous systems
Extending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications
Extending Scala with General Purpose GPU Programming
Extending SYCL’s Programming Paradigm with Tensor-based SIMD Abstractions
Extending the Computational Application of Reaction-Diffusion Chemistry by Modelling Artificial Neural Networks
Extending the Generalized Fermat Prime Number Search Beyond One Million Digits Using GPUs
Extending the Gotran framework: LATEX and GPU acceleration
Extending the Scalability of Single Chip Stream Processors with On-chip Caches
Extending the SkelCL Skeleton Library for Stencil Computations on Multi-GPU Systems
Extension of the SkePU Skeleton Programming Framework for Multi-core CPU and Multi-GPU Systems for MPI-based Clusters
Extensions and Limitations of the Neural GPU
Extensions of Parallel Coordinates for Interactive Exploration of Large Multi-Timepoint Data Sets
Extinction-Based Shading and Illumination in GPU Volume Ray-Casting
Extracting Flow Features Using Bag-of-Features and Supervised Learning Techniques
Extracting Maximal Exact Matches on GPU
Extremely fast simulator for decoding LDPC codes
Extremely large scale simulation of a Kardar-Parisi-Zhang model using graphics cards
Eye-Full Tower: A GPU-based variable multibaseline omnidirectional stereovision system with automatic baseline selection for outdoor mobile robot navigation
Face Detection CUDA Accelerating
Face Detection for Human Identification in Surveillance
Face Detection with Improved Local Binary Patterns in CUDA
Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs
Face Recognition: A Tutorial on Computational Aspects
Face Retriever: Pre-filtering the Gallery via Deep Neural Net
Face Search at Scale: 80 Million Gallery
Face.evoLVe: A High-Performance Face Recognition Library
Facial Expression Recognition – Review
Facial Recognition Using Neural Networks over GPGPU
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Falcon: A Graph Manipulation Language for Heterogeneous Systems
FAMOUS, faster: using parallel computing techniques to accelerate the FAMOUS/HadCM3 climate model with a focus on the radiative transfer algorithm
Fancier: A Unified Framework for Java, C, and OpenCL Integration
FANN-on-MCU: An Open-Source Toolkit for Energy-Efficient Neural Network Inference at the Edge of the Internet of Things
FANS: FPGA-Accelerated Near-Storage Sorting
FARGO3D: A new GPU-oriented MHD code
Fast 2-D Ultrasound Strain Imaging: The Benefits of Using a GPU
Fast 2D-3D registration using GPU-based preprocessing
Fast 3D Graphics Rendering Technique with CUDA Parallel Processing
Fast 3D Salient Region Detection in Medical Images using GPUs
Fast 3D Structure Localization in Medical Volumes using CUDA-enabled GPUs
Fast 3D Wavelet Transform on Multicore and Manycore Computing Platforms
Fast 4D Sheared Filtering for Interactive Rendering of Distribution Effects
Fast 4pi track reconstruction in nuclear emulsion detectors based on GPU technology
Fast Acceleration of 2D Wave Propagation Simulations Using Modern Computational Accelerators
Fast acoustic computations using graphics processors
Fast Adaptive Sampling Technique for Multi-Dimensional Integral Estimation Using GPUs
Fast algorithm of ray tracing based on KD-tree structure
Fast algorithms and efficient GPU implementations for the Radon transform and the back-projection operator represented as convolution operators
Fast Algorithms for Convolutional Neural Networks
Fast Algorithms for the Solution of Stochastic Partial Differential Equations
Fast American Basket Option Pricing on a multi-GPU Cluster
Fast Analysis of Molecular Dynamics Trajectories with Graphics Processing Units – Radial Distribution Function Histogramming
Fast analysis of molecular dynamics trajectories with graphics processing units-Radial distribution function histogramming
Fast analytical modeling of compton scatter using point clouds and graphics processing unit (GPU)
Fast and accurate digital signal processing realized with GPGPU technology
Fast and Accurate Finite-Element Multigrid Solvers for PDE Simulations on GPU Clusters
Fast and Accurate Generalized Harmonic Analysis and Its Parallel Computation by GPU
Fast and accurate PIV computation using highly parallel iterative correlation maximization
Fast and Accurate Poisson Denoising with Optimized Nonlinear Diffusion
Fast and accurate protein substructure searching with simulated annealing and GPUs
Titles: 100
open PDFs: 97
packages: 20