Papers on hgpu.org (.txt-file)
Speeding up the MATLAB Hyperspectral Image Analysis Toolbox using GPUs and the Jacket Toolbox
Speeding up the small progress measures algorithm for parity games using the GPU

Speeding-up Pearson Correlation Coefficient calculation on graphical processing units

Speeding-up the Verification Phase of Set Similarity Joins in the GPGPU paradigm

Speedup and Parallelization Models for Energy-Efficient Many-Core Systems Using Performance Counters

Speedup for quantum optimal control from GPU-based automatic differentiation

Speedup of Fuzzy Clustering Through Stream Processing on Graphics Processing Units

Speedup of Micromagnetic Simulations with C++ AMP On Graphics Processing Units

Speedup of Type-1 Fuzzy Logic Systems on Graphics Processing Units Using CUDA

Speedups between x70 and x120 for a generic local search (memetic) algorithm on a single GPGPU chip

sPEGG: high throughput eco-evolutionary simulations on commodity graphics processors

SPH Based Fluid Animation Using CUDA Enabled GPU

SPH Fluids for Viscous Jet Buckling

Spherical harmonic transform on heterogeneous architectures using hybrid programming

Spherical harmonic transform with GPUs

Spiking Neural Networks for Real-Time Infrared Images Processing in Thermo Vision Systems

SPIRE, a Sequential to Parallel Intermediate Representation Extension

Split tiling for GPUs: automatic parallelization using trapezoidal tiles

Splotch: porting and optimizing for the Xeon Phi

SpMV: A Memory-Bound Application on the GPU Stuck Between a Rock and a Hard Place

SPOC: GPGPU Programming Through Stream Processing With OCaml

Sponge: portable stream programming on graphics engines

Spotting Radio Transients with the help of GPUs

SPRAT: Runtime processor selection for energy-aware computing

Spring-Bead Animation of Viscoelastic Materials

Springald: GPU-Accelerated Window-Based Aggregates Over Out-of-Order Data Streams

Spyx: A Library for Just-In-Time Compiled Optimization of Spiking Neural Networks

SqueezCL: Squeezing OpenCL Kernels for Approximate Computing on Contemporary GPUs

SRAM-DRAM hybrid memory with applications to efficient register files in fine-grained multi-threading

SRP Based Natural Interaction between Real and Virtual Worlds in Augmented Reality
SSLPV: subsurface light propagation volumes

SSLShader: Cheap SSL Acceleration with Commodity Processors

Stability and Performance of Various Singular Value QR Implementations on Multicore CPU with a GPU

Stabilized Backward Diffusion for Partial Volume Correction

Stable large-scale solver for Ginzburg-Landau equations for superconductors

Stack-less SIMT reconvergence at low cost

Stackless KD-Tree Traversal for High Performance GPU Ray Tracing

Stadium Hashing: Scalable and Flexible Hashing on GPUs

Staggered fermions simulations on GPUs

STAR-RT: Visual attention for real-time video game playing

Starchart: Hardware and Software Optimization Using Recursive Partitioning Regression Trees

Stargazer: Automated Regression-Based GPU Design Space Exploration

STARK: Strategic Team of Agents for Refining Kernels

StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators

StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines

StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures

State Lattice-based Motion Planning for Autonomous On-Road Driving

State of The Art Report on GPU

State of the Art Report on Real-time Rendering with Hardware Tessellation

State-Based Gauss-Seidel Framework for Real-time 2D Ultrasound Image Sequence Denoising on GPUs

State-of-the-art in heterogeneous computing

Stateful Dataflow Multigraphs: A Data-Centric Model for High-Performance Parallel Programs

Static Analysis and Dynamic Adaptation of Parallelism

Static and Dynamic Analyses for Efficient GPU Execution

Static Compilation Analysis for Host-Accelerator Communication Optimization

Static GPU threads and an improved scan algorithm

Static Memory Access Pattern Analysis on a Massively Parallel GPU

Statistical Computing With Graphics Processing Units

Statistical constraints on binary black hole inspiral dynamics

Statistical Power Consumption Analysis and Modeling for GPU-based Computing

Statistical power modeling of GPU kernels using performance counters

Statistical testing of random number sequences using CUDA
stdgpu: Efficient STL-like Data Structures on the GPU

Stealing Webpages Rendered on Your Browser by Exploiting GPU Vulnerabilities

Stellar Mergers with HPX-Kokkos and SYCL: Methods of using an Asynchronous Many-Task Runtime System with SYCL

Stellar-mass black holes in star clusters: implications for gravitational wave radiation

Stencil and Lattice Structures for Field Equation Model Simulations on GPUs

Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures

Stencil Computations on AMD and Nvidia Graphics Processors: Performance and Tuning Strategies

Stencil shadow volumes for complex and deformable objects

Stencil-Aware GPU Optimization of Iterative Solvers

StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing Systems

StePS: A Multi-GPU Cosmological N-body Code for Compactified Simulations

Stereo depth with a Unified Architecture GPU

Stereo Matching Algorithm Using Population-Based Incremental Learning on GPU
Stereo Matching using Multi-Resolution Images on CUDA

Stereoscopic Ray Tracing on Graphics Processors

Stereoscopic Scene Flow Computation for 3D Motion Understanding
StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning

Stochastic Analysis of a Queue Length Model Using a Graphics Processing Unit

Stochastic Differential Equations simulation using GPU

Stochastic DT-MRI Connectivity Mapping on the GPU

Stochastic Gradient Descent on GPUs

Stochastic Progressive Photon Mapping for Dynamic Scenes

STOCHSIMGPU: Parallel stochastic simulation for the Systems Biology Toolbox 2 for MATLAB

Stock trading strategy creation using GP on GPU

StoreGPU: exploiting graphics processing units to accelerate distributed storage systems

Strain Visualization of Ultra Sound Signals Processed by General Purpose Graphic Process Unit

Strassen’s Matrix Multiplication on GPUs

Strategies for Maximizing Utilization in multi-CPU & multi-GPU Heterogeneous Architectures

Strategies for Optimization of Parallel Programs

Strategies for preparing computer science students for the multicore world

Strategies for Protecting Intellectual Property when Using CUDA Applications on Graphics Processing Units

Strategies for the Heterogeneous Execution of Large-Scale Simulations on Hybrid Supercomputers

Strategies to minimise the total run time of cyclic graph based genetic programming with GPUs

Titles: 100
open PDFs: 94
packages: 22
