Papers on hgpu.org (.txt-file)
Sparselet Models for Efficient Multiclass Object Detection

Sparser, Better, Faster GPU Parsing

Spatial Data Structures, Sorting and GPU Parallelism for Situated-agent Simulation and Visualisation

Spatial Indexing of Large-Scale Geo-Referenced Point Data on GPGPUs Using Parallel Primitives

Spatial interpolation in massively parallel computing environments

Spatial interpolation of scattered geoscientific data

Spatial Join with R-Tree on Graphics Processing Units

Spatial Sorting Algorithms for Parallel Computing in Networks

Spatial splits in bounding volume hierarchies

Spatial: A Language and Compiler for Application Accelerators

Spatio-temporal upsampling on the GPU

Spatter: A Benchmark Suite for Evaluating Sparse Access Patterns

Special Relativistic Visualization by Local Ray Tracing

Specification and verification of GPGPU programs

Specification and Verification of GPGPU Programs using Permission-Based Separation Logic

Speckle Reduction with Trained Nonlinear Diffusion Filtering

Spectral classification using convolutional neural networks

Spectral Ewald Acceleration of Stokesian Dynamics for polydisperse suspensions

Spectral Method Characterization on FPGA and GPU Accelerators

Spectral volume rendering using GPU-based raycasting

Specular Effects on the GPU: State of the Art

Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs

Speculative Execution on GPU: An Exploratory Study
Speculative Execution on Multi-GPU Systems

Speculative Parallel Evaluation Of Classification Trees On GPGPU Compute Engines

Speculative Parallelization on GPGPUs

Speculative Segmented Sum for Sparse Matrix-Vector Multiplication on Heterogeneous Processors

Specx: a C++ task-based runtime system for heterogeneous distributed architectures

Speech Recognition on Modern Graphic Processing Units

Speech Recognition on Multi-Core Processors and GPUs

Speed and Portability issues for Random Number Generation on Graphical Processing Units with CUDA and other Processing Accelerators

Speed sign detection and recognition by convolutional neural networks

Speed up Large Integer Multiplication Using Fourier Transforms and CUDA Technology

Speed-Up Improvement Using Parallel Approach in Image Steganography

Speed, power and cost implications for GPU acceleration of Computational Fluid Dynamics on HPC systems

Speeding up a few orders of magnitude the Jacobi method: high order Chebyshev-Jacobi over GPUs

Speeding up a Video Summarization Approach Using GPUs and Multicore CPUs

Speeding up Automatic Hyperparameter Optimization of Deep Neural Networks by Extrapolation of Learning Curves

Speeding Up Computer Vision Applications on Mobile Computing Platforms

Speeding Up Cycle Based Logic Simulation Using Graphics Processing Units
Speeding Up Geospatial Polygon Rasterization on GPGPUs

Speeding Up Homomorpic Hashing Using GPUs
Speeding up K-Means Algorithm by GPUs
Speeding up Large-Scale Point-in-Polygon Test Based Spatial Join on GPUs

Speeding up lattice sieve with Xeon Phi coprocessor

Speeding up LIP-Canny with CUDA programming

Speeding Up Model Building for ECGA on CUDA Platform

Speeding up Mutual Information Computation Using NVIDIA CUDA Hardware

Speeding Up Object Detection: Fast Resizing in the Integral Image Domain

Speeding Up Particle Trajectory Simulations under Moving Force Fields using GPUs

Speeding Up Reinforcement Learning with Graphics Processing Units

Speeding up Scoring Module of Mass Spectrometry Based Protein Identification by GPU

Speeding up subset seed algorithm for intensive protein sequence comparison

Speeding up the evaluation of evolutionary learning systems using GPGPUs

Speeding up the evaluation phase of GP classification algorithms on GPUs

Speeding up the MATLAB complex networks package using graphic processors

Speeding up the MATLAB Hyperspectral Image Analysis Toolbox using GPUs and the Jacket Toolbox
Speeding up the small progress measures algorithm for parity games using the GPU

Speeding-up Pearson Correlation Coefficient calculation on graphical processing units

Speeding-up the Verification Phase of Set Similarity Joins in the GPGPU paradigm

Speedup and Parallelization Models for Energy-Efficient Many-Core Systems Using Performance Counters

Speedup for quantum optimal control from GPU-based automatic differentiation

Speedup of Fuzzy Clustering Through Stream Processing on Graphics Processing Units

Speedup of Micromagnetic Simulations with C++ AMP On Graphics Processing Units

Speedup of Type-1 Fuzzy Logic Systems on Graphics Processing Units Using CUDA

Speedups between x70 and x120 for a generic local search (memetic) algorithm on a single GPGPU chip

sPEGG: high throughput eco-evolutionary simulations on commodity graphics processors

SPH Based Fluid Animation Using CUDA Enabled GPU

SPH Fluids for Viscous Jet Buckling

Spherical harmonic transform on heterogeneous architectures using hybrid programming

Spherical harmonic transform with GPUs

Spiking Neural Networks for Real-Time Infrared Images Processing in Thermo Vision Systems

SPIRE, a Sequential to Parallel Intermediate Representation Extension

Split tiling for GPUs: automatic parallelization using trapezoidal tiles

Splotch: porting and optimizing for the Xeon Phi

SpMV: A Memory-Bound Application on the GPU Stuck Between a Rock and a Hard Place

SPOC: GPGPU Programming Through Stream Processing With OCaml

Sponge: portable stream programming on graphics engines

Spotting Radio Transients with the help of GPUs

SPRAT: Runtime processor selection for energy-aware computing

Spring-Bead Animation of Viscoelastic Materials

Springald: GPU-Accelerated Window-Based Aggregates Over Out-of-Order Data Streams

Spyx: A Library for Just-In-Time Compiled Optimization of Spiking Neural Networks

SqueezCL: Squeezing OpenCL Kernels for Approximate Computing on Contemporary GPUs

SRAM-DRAM hybrid memory with applications to efficient register files in fine-grained multi-threading

SRP Based Natural Interaction between Real and Virtual Worlds in Augmented Reality
SSLPV: subsurface light propagation volumes

SSLShader: Cheap SSL Acceleration with Commodity Processors

Stability and Performance of Various Singular Value QR Implementations on Multicore CPU with a GPU

Stabilized Backward Diffusion for Partial Volume Correction

Stable large-scale solver for Ginzburg-Landau equations for superconductors

Stack-less SIMT reconvergence at low cost

Stackless KD-Tree Traversal for High Performance GPU Ray Tracing

Stadium Hashing: Scalable and Flexible Hashing on GPUs

Staggered fermions simulations on GPUs

STAR-RT: Visual attention for real-time video game playing

Titles: 100
open PDFs: 93
packages: 20
