Papers on hgpu.org (.txt-file)
Exposure Render: An Interactive Photo-Realistic Volume Rendering Framework
Expressed Sequence Tag Clustering using Commercial Gaming Hardware
Expressive Array Constructs in an Embedded GPU Kernel Programming Language
Extendable pattern-oriented optimization directives
Extendable Pattern-Oriented Optimization Directives (extended version)
Extended Data Collection: Analysis of Cache Behavior and Performance of Different BVH Memory Layouts for Tracing Incoherent Rays
Extended Dynamic Programming and Fast Multidimensional Search Algorithm for Energy Minization in Stereo and Motion
Extended-precision floating-point numbers for GPU computation
Extending a C-like Language for Portable SIMD Programming
Extending a Run-time Resource Management framework to support OpenCL and Heterogeneous Systems
Extending abstract GPU APIs to shared memory
Extending adaptive sparse grids for stochastic collocation to hybrid parallel architectures
Extending High-Level Synthesis for Task-Parallel Programs
Extending Lyapack for the Solution of Band Lyapunov Equations on Hybrid CPU-GPU Platforms
Extending MAGMA Portability with OneAPI
Extending OmpSs for OpenCL kernel co-execution in heterogeneous systems
Extending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications
Extending Scala with General Purpose GPU Programming
Extending SYCL’s Programming Paradigm with Tensor-based SIMD Abstractions
Extending the Computational Application of Reaction-Diffusion Chemistry by Modelling Artificial Neural Networks
Extending the Generalized Fermat Prime Number Search Beyond One Million Digits Using GPUs
Extending the Gotran framework: LATEX and GPU acceleration
Extending the Scalability of Single Chip Stream Processors with On-chip Caches
Extending the SkelCL Skeleton Library for Stencil Computations on Multi-GPU Systems
Extension of the SkePU Skeleton Programming Framework for Multi-core CPU and Multi-GPU Systems for MPI-based Clusters
Extensions and Limitations of the Neural GPU
Extensions of Parallel Coordinates for Interactive Exploration of Large Multi-Timepoint Data Sets
Extinction-Based Shading and Illumination in GPU Volume Ray-Casting
Extracting Flow Features Using Bag-of-Features and Supervised Learning Techniques
Extracting Maximal Exact Matches on GPU
Extremely fast simulator for decoding LDPC codes
Extremely large scale simulation of a Kardar-Parisi-Zhang model using graphics cards
Eye-Full Tower: A GPU-based variable multibaseline omnidirectional stereovision system with automatic baseline selection for outdoor mobile robot navigation
Face Detection CUDA Accelerating
Face Detection for Human Identification in Surveillance
Face Detection with Improved Local Binary Patterns in CUDA
Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs
Face Recognition: A Tutorial on Computational Aspects
Face Retriever: Pre-filtering the Gallery via Deep Neural Net
Face Search at Scale: 80 Million Gallery
Face.evoLVe: A High-Performance Face Recognition Library
Facial Expression Recognition – Review
Facial Recognition Using Neural Networks over GPGPU
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Falcon: A Graph Manipulation Language for Heterogeneous Systems
FAMOUS, faster: using parallel computing techniques to accelerate the FAMOUS/HadCM3 climate model with a focus on the radiative transfer algorithm
Fancier: A Unified Framework for Java, C, and OpenCL Integration
FANN-on-MCU: An Open-Source Toolkit for Energy-Efficient Neural Network Inference at the Edge of the Internet of Things
FANS: FPGA-Accelerated Near-Storage Sorting
FARGO3D: A new GPU-oriented MHD code
Fast 2-D Ultrasound Strain Imaging: The Benefits of Using a GPU
Fast 2D-3D registration using GPU-based preprocessing
Fast 3D Graphics Rendering Technique with CUDA Parallel Processing
Fast 3D Salient Region Detection in Medical Images using GPUs
Fast 3D Structure Localization in Medical Volumes using CUDA-enabled GPUs
Fast 3D Wavelet Transform on Multicore and Manycore Computing Platforms
Fast 4D Sheared Filtering for Interactive Rendering of Distribution Effects
Fast 4pi track reconstruction in nuclear emulsion detectors based on GPU technology
Fast Acceleration of 2D Wave Propagation Simulations Using Modern Computational Accelerators
Fast acoustic computations using graphics processors
Fast Adaptive Sampling Technique for Multi-Dimensional Integral Estimation Using GPUs
Fast algorithm of ray tracing based on KD-tree structure
Fast algorithms and efficient GPU implementations for the Radon transform and the back-projection operator represented as convolution operators
Fast Algorithms for Convolutional Neural Networks
Fast Algorithms for the Solution of Stochastic Partial Differential Equations
Fast American Basket Option Pricing on a multi-GPU Cluster
Fast Analysis of Molecular Dynamics Trajectories with Graphics Processing Units – Radial Distribution Function Histogramming
Fast analysis of molecular dynamics trajectories with graphics processing units-Radial distribution function histogramming
Fast analytical modeling of compton scatter using point clouds and graphics processing unit (GPU)
Fast and accurate digital signal processing realized with GPGPU technology
Fast and Accurate Finite-Element Multigrid Solvers for PDE Simulations on GPU Clusters
Fast and Accurate Generalized Harmonic Analysis and Its Parallel Computation by GPU
Fast and accurate PIV computation using highly parallel iterative correlation maximization
Fast and Accurate Poisson Denoising with Optimized Nonlinear Diffusion
Fast and accurate protein substructure searching with simulated annealing and GPUs
Fast and approximate stream mining of quantiles and frequencies using graphics processors
Fast and automatic object pose estimation for range images on the GPU
Fast and Efficient Automatic Memory Management for GPUs using Compiler-Assisted Runtime Coherence Scheme
Fast and Efficient Dense Variational Stereo on GPU
Fast and Efficient FPGA-Based Feature Detection Employing the SURF Algorithm
Fast and Efficient Lossless Image Compression Based on CUDA Parallel Wavelet Tree Encoding
Fast and Energy-Efficient CNN Inference on IoT Devices
Fast and exact solution of Total Variation models on the GPU
Fast and Flexible GPU Accelerated Binding Free Energy Calculations within the AMBER Molecular Dynamics Package
Fast and Flexible: Parallel Packet Processing with GPUs and Click
Fast and informative flow simulations in a building by using fast fluid dynamics model on graphics processing unit
Fast and Maliciously Secure Two-Party Computation Using the GPU
Fast and Memory Efficient GPU-Based Rendering of Tensor Data
Fast and Memory-Efficient Minimum Spanning Tree on the GPU
Fast and Practical Strassen’s Matrix Multiplication using FPGAs
Fast and reliable collision culling using graphics hardware
Fast and Robust 3D Correspondence Matching and Its Application to Volume Registration
Fast and robust CAMShift tracking
Fast and Robust Linear Motion Deblurring
Fast and Robust Pyramid-based Image Processing
Fast and Scalable CPU/GPU Collision Detection for Rigid and Deformable Surfaces
Fast and scalable list ranking on the GPU
Titles: 100
open PDFs: 96
packages: 23