Papers on hgpu.org (.txt-file)
Sinus Endoscopy – Application of Advanced GPU Volume Rendering for Virtual Endoscopy

Six-fold speed-up of Smith-Waterman sequence database searches using parallel processing on common microprocessors

Size Matters: Space/Time Tradeoffs to Improve GPGPU Applications Performance

Size-based Transfer Functions: A New Volume Exploration Technique

Skeletal rigid skinning with blending patches on the GPU
Skeleton and Shape Adjustment and Tracking in Multicamera Environments

Skeleton Programming for Heterogeneous GPU-based Systems

Skeleton-based Automatic Parallelization of Image Processing Algorithms for GPUs

Skeleton-based edge bundling for graph visualization

SkePU 2: Flexible and Type-Safe Skeleton Programming for Heterogeneous Parallel Systems

SkePU 2: Language Embedding and Compiler Support for Flexible and Type-Safe Skeleton Programming

SkePU: a multi-backend skeleton programming library for multi-GPU systems

Sketch Based Facial Expression Recognition Using Graphics Hardware

Sketching MLS Image Deformations On the GPU

Skew Handling in Aggregate Streaming Queries on GPUs

Skinning with dual quaternions

SKMD: Single Kernel on Multiple Devices for Transparent CPU-GPU Collaboration

SkyFlow: Heterogeneous streaming for skyline computation using FlowGraph and SYCL

SLATE port to AMD and Intel platforms

Sliding-Tris: A Sliding Window Level-of-Detail Scheme
Sliding-Windows for Rapid Object Class Localization: A Parallel Technique

SMAA: Enhanced Subpixel Morphological Antialiasing

Small Discrete Fourier Transforms on GPUs

Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Small-ruleset regular expression matching on GPGPUs: quantitative performance analysis and optimization

Smart Multi-Task Scheduling for OpenCL Programs on CPU/GPU Heterogeneous Platforms

SMCGen: Generating Reconfigurable Design for Sequential Monte Carlo Applications

Smith-Waterman Acceleration in Multi-GPUs: A Performance per Watt Analysis

Smooth Mixed-Resolution GPU Volume Rendering

Smoothed Particle Hydrodynamics Simulation for Continuous Casting

Smoothed-Particle Hydrodynamics Models: Implementation Features on GPUs

SneakySnake: A Fast and Accurate Universal Genome Pre-Alignment Filter for CPUs, GPUs, and FPGAs

Snowflake: A Lightweight Portable Stencil DSL

SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters

SnuHPL: high performance LINPACK for heterogeneous GPUs
SoaAlloc: Accelerating Single-Method Multiple-Objects Applications on GPUs

SOAP3-dp: Fast, Accurate and Sensitive GPU-based Short Read Aligner

SoAx: A generic C++ Structure of Arrays for handling Particles in HPC Codes

SOCL: An OpenCL Implementation with Automatic Multi-Device Adaptation Support

SODECL: An Open Source Library for Calculating Multiple Orbits of a System of Stochastic Differential Equations in Parallel

SOFF: An OpenCL High-Level Synthesis Framework for FPGAs

Soft Error Resilient QR Factorization for Hybrid System

Soft Error Resilient QR Factorization for Hybrid System with GPGPU

Soft GPGPUs for Embedded FPGAs: An Architectural Evaluation

Softshell: Dynamic Scheduling on GPUs

Software Challenges for Extreme Scale Computing: Going From Petascale to Exascale Systems

Software Compilation Techniques for Heterogeneous Embedded Multi-Core Systems

Software Defined Radio over CUDA

Software Development Tools Using GPGPU Potentialities

Software Model Checking for GPGPU Programs, Towards a Verification Tool

Software Optimization and Orchestration for Heterogeneous and Distributed Architectures

Software parallel CAVLC encoder based on stream processing

Software Performance Analysis with Parallel Programming Approaches

Software Pipelined Execution of Stream Programs on GPUs

Software Platform for Hybrid Resource Management of Many-core Accelerators

Software Polarization Spectrometer "PolariS"

Software Prefetching for Indirect Memory Accesses

Software Reliability Enhancements for GPU Applications

Software Testing – Test Suite Compilation and Execution Optimizations

Software-Based Algorithm for Modeling and Correction of Gradient Nonlinearity Distortions in Magnetic Resonance Imaging

Software-based branch predication for AMD GPUs
Software-Based Hardening Strategies for Neutron Sensitive FFT Algorithms on GPUs

Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications

SoK: A Systems Perspective on Compound AI Threats and Countermeasures

SOL: Effortless Device Support for AI Frameworks without Source Code Changes

SOL: Reducing the Maintenance Overhead for Integrating Hardware Support into AI Frameworks

Solution Level Parallelization of Local Search Metaheuristic Algorithm on GPU

Solutions for Optimizing the Monte Carlo Option Pricing Method’s Implementation Using the Compute Unified Device Architecture

Solutions For Optimizing The Radix Sort Algorithmic Function Using The Compute Unified Device Architecture

Solver for Systems of Linear Equations with Infinite Precision on a GPU Cluster

Solving $k$-Nearest Vector Problem on Multiple Graphics Processors

Solving 3D Anisotropic Elastic Wave Equations on Parallel GPU Devices

Solving 3D incompressible Navier-Stokes equations on hybrid CPU/GPU systems

Solving 3D viscous incompressible Navier-Stokes equations using CUDA

Solving a kind of BVP for ODEs on heterogeneous CPU + CUDA-enabled GPU systems

Solving Batched Linear Programs on GPU and Multicore CPU

Solving Bivariate Polynomial Systems on a GPU

Solving convex optimization problems on FPGA using OpenCL

Solving Dense Generalized Eigenproblems on Multi-threaded Architectures

Solving Dense Linear Systems on Graphics Processors

Solving dense linear systems on platforms with multiple hardware accelerators

Solving diffractive optics problems using graphics processing units
Solving Discrete Logarithms in Smooth-Order Groups with CUDA

Solving incompressible Navier-Stokes equations on heterogeneous parallel architectures

Solving Incompressible Two-Phase Flows on Massively Parallel Multi-GPU Clusters

Solving incompressible two-phase flows on multi-GPU clusters

Solving Kinetic Equations on GPUs I: Model Kinetic Equations

Solving knapsack problems on GPU
Solving large permutation flow-shop scheduling problems on GPU-accelerated supercomputers

Solving Large Regression Problems using an Ensemble of GPU-accelerated ELMs

Solving lattice QCD systems of equations using mixed precision solvers on GPUs

Solving Linear Equations with Conjugate Gradient Method on OpenCL Platforms

Solving Linear Recurrences on Hybrid GPU Accelerated Manycore Systems

Solving MaxSAT with Matrix Multiplication

Solving Mixed Integer Programs Using Neural Networks

Solving Molecular Distance Geometry Problems in OpenCL

Titles: 100
open PDFs: 92
packages: 22
