Papers on hgpu.org (.txt-file)
SneakySnake: A Fast and Accurate Universal Genome Pre-Alignment Filter for CPUs, GPUs, and FPGAs

Snowflake: A Lightweight Portable Stencil DSL

SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters

SnuHPL: high performance LINPACK for heterogeneous GPUs
SoaAlloc: Accelerating Single-Method Multiple-Objects Applications on GPUs

SOAP3-dp: Fast, Accurate and Sensitive GPU-based Short Read Aligner

SoAx: A generic C++ Structure of Arrays for handling Particles in HPC Codes

SOCL: An OpenCL Implementation with Automatic Multi-Device Adaptation Support

SODECL: An Open Source Library for Calculating Multiple Orbits of a System of Stochastic Differential Equations in Parallel

SOFF: An OpenCL High-Level Synthesis Framework for FPGAs

Soft Error Resilient QR Factorization for Hybrid System

Soft Error Resilient QR Factorization for Hybrid System with GPGPU

Soft GPGPUs for Embedded FPGAs: An Architectural Evaluation

Softshell: Dynamic Scheduling on GPUs

Software Challenges for Extreme Scale Computing: Going From Petascale to Exascale Systems

Software Compilation Techniques for Heterogeneous Embedded Multi-Core Systems

Software Defined Radio over CUDA

Software Development Tools Using GPGPU Potentialities

Software Model Checking for GPGPU Programs, Towards a Verification Tool

Software Optimization and Orchestration for Heterogeneous and Distributed Architectures

Software parallel CAVLC encoder based on stream processing

Software Performance Analysis with Parallel Programming Approaches

Software Pipelined Execution of Stream Programs on GPUs

Software Platform for Hybrid Resource Management of Many-core Accelerators

Software Polarization Spectrometer "PolariS"

Software Prefetching for Indirect Memory Accesses

Software Reliability Enhancements for GPU Applications

Software Testing – Test Suite Compilation and Execution Optimizations

Software-Based Algorithm for Modeling and Correction of Gradient Nonlinearity Distortions in Magnetic Resonance Imaging

Software-based branch predication for AMD GPUs
Software-Based Hardening Strategies for Neutron Sensitive FFT Algorithms on GPUs

Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications

SoK: A Systems Perspective on Compound AI Threats and Countermeasures

SOL: Effortless Device Support for AI Frameworks without Source Code Changes

SOL: Reducing the Maintenance Overhead for Integrating Hardware Support into AI Frameworks

Solution Level Parallelization of Local Search Metaheuristic Algorithm on GPU

Solutions for Optimizing the Monte Carlo Option Pricing Method’s Implementation Using the Compute Unified Device Architecture

Solutions For Optimizing The Radix Sort Algorithmic Function Using The Compute Unified Device Architecture

Solver for Systems of Linear Equations with Infinite Precision on a GPU Cluster

Solving $k$-Nearest Vector Problem on Multiple Graphics Processors

Solving 3D Anisotropic Elastic Wave Equations on Parallel GPU Devices

Solving 3D incompressible Navier-Stokes equations on hybrid CPU/GPU systems

Solving 3D viscous incompressible Navier-Stokes equations using CUDA

Solving a kind of BVP for ODEs on heterogeneous CPU + CUDA-enabled GPU systems

Solving Batched Linear Programs on GPU and Multicore CPU

Solving Bivariate Polynomial Systems on a GPU

Solving convex optimization problems on FPGA using OpenCL

Solving Dense Generalized Eigenproblems on Multi-threaded Architectures

Solving Dense Linear Systems on Graphics Processors

Solving dense linear systems on platforms with multiple hardware accelerators

Solving diffractive optics problems using graphics processing units
Solving Discrete Logarithms in Smooth-Order Groups with CUDA

Solving incompressible Navier-Stokes equations on heterogeneous parallel architectures

Solving Incompressible Two-Phase Flows on Massively Parallel Multi-GPU Clusters

Solving incompressible two-phase flows on multi-GPU clusters

Solving Kinetic Equations on GPUs I: Model Kinetic Equations

Solving knapsack problems on GPU
Solving large permutation flow-shop scheduling problems on GPU-accelerated supercomputers

Solving Large Regression Problems using an Ensemble of GPU-accelerated ELMs

Solving lattice QCD systems of equations using mixed precision solvers on GPUs

Solving Linear Equations with Conjugate Gradient Method on OpenCL Platforms

Solving Linear Recurrences on Hybrid GPU Accelerated Manycore Systems

Solving MaxSAT with Matrix Multiplication

Solving Mixed Integer Programs Using Neural Networks

Solving Molecular Distance Geometry Problems in OpenCL

Solving Multiple Queries through a Permutation Index in GPU

Solving Parabolic Problems Using Multithread and GPU
Solving Path Problems on the GPU

Solving prime-field ECDLPs on GPUs with OpenCL

Solving quadratic assignment problems by genetic algorithms with GPU computation: a case study

Solving Quadratic Programming Problems on Graphics Processing Unit

Solving RFIC Simulation Tasks Using GPU Computations

Solving Rigid Multibody Physics Dynamics Using Proximal Point Functions on the GPU

Solving Sparse Linear Systems on NVIDIA Tesla GPUs
Solving Stochastic Differential Equations Using General Purpose Graphics Processing Unit

Solving Systems of Polynomial Equations on a GPU

Solving the Boltzmann Equation on GPU

Solving the Boltzmann equation on GPUs

Solving the Caputo Fractional Reaction-Diffusion Equation on GPU

Solving the Coalition Structure Generation Problem on a GPU

Solving the Euler Equations on Graphics Processing Units

Solving the Examination Timetabling Problem in GPUs

Solving the Flexible Job Shop Problem on Multi-GPU

Solving the Ghost-Gluon System of Yang-Mills Theory on GPUs

Solving the Quadratic Assignment Problem on heterogeneous environment (CPUs and GPUs) with the application of Level 2 Reformulation and Linearization Technique

Solving the Vlasov equation for one-dimensional models with long range interactions on a GPU

Solving very large instances of the scheduling of independent tasks problem on the GPU

Solving Wave Equations on Unstructured Geometries

Some examples of instant computations of fluid dynamics on GPU

Some Graph Algorithms And Related Primitives For The GPU

Some of the What?, Why?, How?, Who? and Where? of Graphics Processing Unit Computing for Bayesian Analysis

SOMGPU: An unsupervised pattern classifier on Graphical Processing Unit

Somoclu: An Efficient Distributed Library for Self-Organizing Maps

Sop-GPU: Accelerating biomolecular simulations in the centisecond timescale using graphics processors

Soren: Adaptive MapReduce for Programmable GPUs

Sort-First Parallel Volume Rendering

Titles: 100
open PDFs: 92
packages: 20
