Papers on hgpu.org (.txt-file)
Speeding up the small progress measures algorithm for parity games using the GPU
Speeding-up Pearson Correlation Coefficient calculation on graphical processing units
Speeding-up the Verification Phase of Set Similarity Joins in the GPGPU paradigm
Speedup and Parallelization Models for Energy-Efficient Many-Core Systems Using Performance Counters
Speedup for quantum optimal control from GPU-based automatic differentiation
Speedup of Fuzzy Clustering Through Stream Processing on Graphics Processing Units
Speedup of Micromagnetic Simulations with C++ AMP On Graphics Processing Units
Speedup of Type-1 Fuzzy Logic Systems on Graphics Processing Units Using CUDA
Speedups between x70 and x120 for a generic local search (memetic) algorithm on a single GPGPU chip
sPEGG: high throughput eco-evolutionary simulations on commodity graphics processors
SPH Based Fluid Animation Using CUDA Enabled GPU
SPH Fluids for Viscous Jet Buckling
Spherical harmonic transform on heterogeneous architectures using hybrid programming
Spherical harmonic transform with GPUs
Spiking Neural Networks for Real-Time Infrared Images Processing in Thermo Vision Systems
SPIRE, a Sequential to Parallel Intermediate Representation Extension
Split tiling for GPUs: automatic parallelization using trapezoidal tiles
Splotch: porting and optimizing for the Xeon Phi
SpMV: A Memory-Bound Application on the GPU Stuck Between a Rock and a Hard Place
SPOC: GPGPU Programming Through Stream Processing With OCaml
Sponge: portable stream programming on graphics engines
Spotting Radio Transients with the help of GPUs
SPRAT: Runtime processor selection for energy-aware computing
Spring-Bead Animation of Viscoelastic Materials
Spyx: A Library for Just-In-Time Compiled Optimization of Spiking Neural Networks
SqueezCL: Squeezing OpenCL Kernels for Approximate Computing on Contemporary GPUs
SRAM-DRAM hybrid memory with applications to efficient register files in fine-grained multi-threading
SRP Based Natural Interaction between Real and Virtual Worlds in Augmented Reality
SSLPV: subsurface light propagation volumes
SSLShader: Cheap SSL Acceleration with Commodity Processors
Stability and Performance of Various Singular Value QR Implementations on Multicore CPU with a GPU
Stabilized Backward Diffusion for Partial Volume Correction
Stable large-scale solver for Ginzburg-Landau equations for superconductors
Stack-less SIMT reconvergence at low cost
Stackless KD-Tree Traversal for High Performance GPU Ray Tracing
Stadium Hashing: Scalable and Flexible Hashing on GPUs
Staggered fermions simulations on GPUs
STAR-RT: Visual attention for real-time video game playing
Starchart: Hardware and Software Optimization Using Recursive Partitioning Regression Trees
Stargazer: Automated Regression-Based GPU Design Space Exploration
StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators
StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines
StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures
State Lattice-based Motion Planning for Autonomous On-Road Driving
State of The Art Report on GPU
State of the Art Report on Real-time Rendering with Hardware Tessellation
State-Based Gauss-Seidel Framework for Real-time 2D Ultrasound Image Sequence Denoising on GPUs
State-of-the-art in heterogeneous computing
Stateful Dataflow Multigraphs: A Data-Centric Model for High-Performance Parallel Programs
Static Analysis and Dynamic Adaptation of Parallelism
Static and Dynamic Analyses for Efficient GPU Execution
Static Compilation Analysis for Host-Accelerator Communication Optimization
Static GPU threads and an improved scan algorithm
Static Memory Access Pattern Analysis on a Massively Parallel GPU
Statistical Computing With Graphics Processing Units
Statistical constraints on binary black hole inspiral dynamics
Statistical Power Consumption Analysis and Modeling for GPU-based Computing
Statistical power modeling of GPU kernels using performance counters
Statistical testing of random number sequences using CUDA
stdgpu: Efficient STL-like Data Structures on the GPU
Stealing Webpages Rendered on Your Browser by Exploiting GPU Vulnerabilities
Stellar Mergers with HPX-Kokkos and SYCL: Methods of using an Asynchronous Many-Task Runtime System with SYCL
Stellar-mass black holes in star clusters: implications for gravitational wave radiation
Stencil and Lattice Structures for Field Equation Model Simulations on GPUs
Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures
Stencil shadow volumes for complex and deformable objects
Stencil-Aware GPU Optimization of Iterative Solvers
StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing Systems
StePS: A Multi-GPU Cosmological N-body Code for Compactified Simulations
Stereo depth with a Unified Architecture GPU
Stereo Matching Algorithm Using Population-Based Incremental Learning on GPU
Stereo Matching using Multi-Resolution Images on CUDA
Stereoscopic Ray Tracing on Graphics Processors
Stereoscopic Scene Flow Computation for 3D Motion Understanding
Stochastic Analysis of a Queue Length Model Using a Graphics Processing Unit
Stochastic Differential Equations simulation using GPU
Stochastic DT-MRI Connectivity Mapping on the GPU
Stochastic Gradient Descent on GPUs
Stochastic Progressive Photon Mapping for Dynamic Scenes
STOCHSIMGPU: Parallel stochastic simulation for the Systems Biology Toolbox 2 for MATLAB
Stock trading strategy creation using GP on GPU
StoreGPU: exploiting graphics processing units to accelerate distributed storage systems
Strain Visualization of Ultra Sound Signals Processed by General Purpose Graphic Process Unit
Strassen’s Matrix Multiplication on GPUs
Strategies for Maximizing Utilization in multi-CPU & multi-GPU Heterogeneous Architectures
Strategies for Optimization of Parallel Programs
Strategies for preparing computer science students for the multicore world
Strategies for Protecting Intellectual Property when Using CUDA Applications on Graphics Processing Units
Strategies for the Heterogeneous Execution of Large-Scale Simulations on Hybrid Supercomputers
Strategies to minimise the total run time of cyclic graph based genetic programming with GPUs
Strategy Preserving Compilation for Parallel Functional Code
Stream computing on graphics hardware
Stream Join Processing on Heterogeneous Processors
Stream processing for fast and efficient rotated Haar-like features using rotated integral images
Stream Processing of Integral Images for Real-Time Object Detection
Titles: 100
open PDFs: 95
packages: 20