Papers on hgpu.org (.txt-file)
Extending a Run-time Resource Management framework to support OpenCL and Heterogeneous Systems

Extending abstract GPU APIs to shared memory

Extending adaptive sparse grids for stochastic collocation to hybrid parallel architectures

Extending High-Level Synthesis for Task-Parallel Programs

Extending Lyapack for the Solution of Band Lyapunov Equations on Hybrid CPU-GPU Platforms

Extending MAGMA Portability with OneAPI

Extending OmpSs for OpenCL kernel co-execution in heterogeneous systems

Extending OmpSs to support CUDA and OpenCL in C, C++ and Fortran Applications

Extending Scala with General Purpose GPU Programming

Extending SYCL’s Programming Paradigm with Tensor-based SIMD Abstractions

Extending the Computational Application of Reaction-Diffusion Chemistry by Modelling Artificial Neural Networks

Extending the Generalized Fermat Prime Number Search Beyond One Million Digits Using GPUs

Extending the Gotran framework: LATEX and GPU acceleration

Extending the Scalability of Single Chip Stream Processors with On-chip Caches

Extending the SkelCL Skeleton Library for Stencil Computations on Multi-GPU Systems

Extension of the SkePU Skeleton Programming Framework for Multi-core CPU and Multi-GPU Systems for MPI-based Clusters

Extensions and Limitations of the Neural GPU

Extensions of Parallel Coordinates for Interactive Exploration of Large Multi-Timepoint Data Sets

Extinction-Based Shading and Illumination in GPU Volume Ray-Casting

Extracting Flow Features Using Bag-of-Features and Supervised Learning Techniques

Extracting Maximal Exact Matches on GPU

Extremely fast simulator for decoding LDPC codes
Extremely large scale simulation of a Kardar-Parisi-Zhang model using graphics cards

Eye-Full Tower: A GPU-based variable multibaseline omnidirectional stereovision system with automatic baseline selection for outdoor mobile robot navigation

Face Detection CUDA Accelerating

Face Detection for Human Identification in Surveillance

Face Detection with Improved Local Binary Patterns in CUDA

Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs

Face Recognition: A Tutorial on Computational Aspects

Face Retriever: Pre-filtering the Gallery via Deep Neural Net

Face Search at Scale: 80 Million Gallery

Face.evoLVe: A High-Performance Face Recognition Library

Facial Expression Recognition – Review

Facial Recognition Using Neural Networks over GPGPU

fairseq: A Fast, Extensible Toolkit for Sequence Modeling

Falcon: A Graph Manipulation Language for Heterogeneous Systems

FAMOUS, faster: using parallel computing techniques to accelerate the FAMOUS/HadCM3 climate model with a focus on the radiative transfer algorithm

Fancier: A Unified Framework for Java, C, and OpenCL Integration

FANN-on-MCU: An Open-Source Toolkit for Energy-Efficient Neural Network Inference at the Edge of the Internet of Things

FANS: FPGA-Accelerated Near-Storage Sorting

FARGO3D: A new GPU-oriented MHD code

Fast 2-D Ultrasound Strain Imaging: The Benefits of Using a GPU

Fast 2D-3D registration using GPU-based preprocessing
Fast 3D Graphics Rendering Technique with CUDA Parallel Processing

Fast 3D Salient Region Detection in Medical Images using GPUs

Fast 3D Structure Localization in Medical Volumes using CUDA-enabled GPUs

Fast 3D Wavelet Transform on Multicore and Manycore Computing Platforms

Fast 4D Sheared Filtering for Interactive Rendering of Distribution Effects

Fast 4pi track reconstruction in nuclear emulsion detectors based on GPU technology

Fast Acceleration of 2D Wave Propagation Simulations Using Modern Computational Accelerators

Fast acoustic computations using graphics processors

Fast Adaptive Sampling Technique for Multi-Dimensional Integral Estimation Using GPUs

Fast algorithm of ray tracing based on KD-tree structure

Fast algorithms and efficient GPU implementations for the Radon transform and the back-projection operator represented as convolution operators

Fast Algorithms for Convolutional Neural Networks

Fast Algorithms for the Solution of Stochastic Partial Differential Equations

Fast American Basket Option Pricing on a multi-GPU Cluster

Fast Analysis of Molecular Dynamics Trajectories with Graphics Processing Units – Radial Distribution Function Histogramming

Fast analysis of molecular dynamics trajectories with graphics processing units-Radial distribution function histogramming

Fast analytical modeling of compton scatter using point clouds and graphics processing unit (GPU)
Fast and accurate digital signal processing realized with GPGPU technology

Fast and Accurate Finite-Element Multigrid Solvers for PDE Simulations on GPU Clusters

Fast and Accurate Generalized Harmonic Analysis and Its Parallel Computation by GPU

Fast and accurate PIV computation using highly parallel iterative correlation maximization

Fast and Accurate Poisson Denoising with Optimized Nonlinear Diffusion

Fast and accurate protein substructure searching with simulated annealing and GPUs

Fast and approximate stream mining of quantiles and frequencies using graphics processors

Fast and automatic object pose estimation for range images on the GPU

Fast and Efficient Automatic Memory Management for GPUs using Compiler-Assisted Runtime Coherence Scheme

Fast and Efficient Dense Variational Stereo on GPU

Fast and Efficient FPGA-Based Feature Detection Employing the SURF Algorithm
Fast and Efficient Lossless Image Compression Based on CUDA Parallel Wavelet Tree Encoding

Fast and Energy-Efficient CNN Inference on IoT Devices

Fast and exact solution of Total Variation models on the GPU

Fast and Flexible GPU Accelerated Binding Free Energy Calculations within the AMBER Molecular Dynamics Package

Fast and Flexible: Parallel Packet Processing with GPUs and Click

Fast and informative flow simulations in a building by using fast fluid dynamics model on graphics processing unit

Fast and Maliciously Secure Two-Party Computation Using the GPU

Fast and Memory Efficient GPU-Based Rendering of Tensor Data

Fast and Memory-Efficient Minimum Spanning Tree on the GPU

Fast and Practical Strassen’s Matrix Multiplication using FPGAs

Fast and reliable collision culling using graphics hardware

Fast and Robust 3D Correspondence Matching and Its Application to Volume Registration

Fast and robust CAMShift tracking

Fast and Robust Linear Motion Deblurring

Fast and Robust Pyramid-based Image Processing

Fast and Scalable CPU/GPU Collision Detection for Rigid and Deformable Surfaces

Fast and scalable list ranking on the GPU

Fast and sleek glyph rendering for interactive HARDI data exploration

Fast Antenna Characterization Using the Sources Reconstruction Method on Graphics Processors

Fast approximate k-nearest neighbours search using GPGPU
Fast Approximation of High-Order Voronoi Diagrams and Distance Transforms on the GPU

Fast Arbitrary Precision Floating Point on FPGA

Fast Automatic Heuristic Construction Using Active Learning

Fast binding site mapping using GPUs and CUDA

Fast Bio-Inspired Computation using a GPU-based Systemic Computer
Fast Boolean Calculations Using the GPU

Titles: 100
open PDFs: 94
packages: 21
