Papers on hgpu.org (.txt-file)
Scattering Parameters and Surface Normals from Homogeneous Translucent Materials using Photometric Stereo

Scattering Points in Parallel Coordinates

Scene Boundary Detection Technique Based on Bottom-Up Attention System and OpenCL Parallel Implementation

Scene image classfying via the Partially Connected Neural Network
Scene independent real-time indirect illumination

Scene Recognition Acceleration Using CUDA and OpenMP
SCF: a device- and language-independent task coordination framework for reconfigurable, heterogeneous systems

SCGPSim: A fast SystemC simulator on GPUs

Scheduling (ir)regular applications on heterogeneous platforms

Scheduling a Parallel Sparse Direct Solver to Multiple GPUs

Scheduling by Work-Stealing in Hybrid Parallel Architectures

Scheduling Computation Graphs of Deep Learning Models on Manycore CPUs

Scheduling data flow program in xkaapi: A new affinity based Algorithm for Heterogeneous Architectures

Scheduling Dataflow Execution Across Multiple Accelerators

Scheduling Deep Learning Jobs in Multi-Tenant GPU Clusters via Wise Resource Sharing

Scheduling for new computing platforms with GPUs

Scheduling Languages: A Past, Present, and Future Taxonomy

Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources

Scheduling on Manycore and Heterogeneous Graphics Processors

Scheduling Parallel Tasks under Multiple Resources: List Scheduling vs. Pack Scheduling

Scheduling processing of real-time data streams on heterogeneous multi-GPU systems

Scheduling Tasks over Multicore machines enhanced with Accelerators: a Runtime System’s Perspective

SciAI4Industry – Solving PDEs for industry-scale problems with deep learning

Scientific and Engineering Computing Using ATI Stream Technology

Scientific computation for simulations on programmable graphics hardware

Scientific Computation on Graphics Processing Unit using CUDA

Scientific Computation Through a GPU
Scientific Computing on Heterogeneous Architectures

Scientific Computing on Hybrid Architectures

Scientific Computing Using Consumer Video-Gaming Hardware Devices

Scientific Computing with Python on GPUs

Scientific GPU Programming with Data-Flow Languages

Scientific Programming for Heterogeneous Systems – Bridging the Gap between Algorithms and Applications

Scientific Visualization in Astronomy: Towards the Petascale Astronomy Era

Scope for performance enhancement of CMU Sphinx by parallelising with OpenCL

Scope is all you need: Transforming LLMs for HPC Code

Scout: a data-parallel programming language for graphics processors
Seamless acceleration of Fortran intrinsics via AMD AI engines

Seamless Dynamic Runtime Reconfiguration in a Software-Defined Radio

Seamless GPU acceleration for C++ based physics with the Metal Shading Language on Apple’s M series unified chips

Searching CUDA code autotuning spaces with hardware performance counters: data from benchmarks running on various GPU architectures

Searching for a counterexample of Kurepa’s Conjecture

Searching for Concurrent Design Patterns in Video Games

Searching for sinks of Henon map using a multiple-precision GPU arithmetic library

Second Order Pre-Integrated Volume Rendering

Secret Key Cryptography Using Graphics Cards

Secure 3D graphics for virtual machines

Secure Distributed Computing on a Manycore Cloud

SecureMed: Secure Medical Computation using GPU-Accelerated Homomorphic Encryption Scheme

Securing GPU via Region-based Bounds Checking

Seeded ND medical image segmentation by cellular automaton on GPU

SeedFold: Scaling Biomolecular Structure Prediction

Seeing through the fog: an algorithm for fast and accurate touch detection in optical tabletop surfaces

Seer: Predictive Runtime Kernel Selection for Irregular Problems

Seismic Attributes Extraction Based on GPU

Seismic damage simulation for urban buildings based on high-performance GPU computing

Seismic imaging based on spectral differentiation matrix and GPU implementation

Seismic volume visualization for horizon extraction

Seismic Wave Propagation Simulation Using Accelerated Support Operator Rupture Dynamics on Multi-GPU

Seismic Wave Propagation Simulation Using Support Operator Method on multi-GPU system

Selecting the Best Tridiagonal System Solver Projected on Multi-Core CPU and GPU Platforms

Selection algorithm of graphic accelerators in heterogeneous cluster for optimization computing

Selection of Task Implementations in the Nanos++ Runtime

Self-Adapting Parallel Framework for Long-Term Object Tracking

Self-Adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures

Self-calibration of geometric and radiometric parameters for cone-beam computed tomography

self-CD: Interactive Self-collision Detection for Deformable Body Simulation Using GPUs

Self-Configuring Applications for Heterogeneous Systems: Program Composition and Optimization Using Cognitive Techniques

Self-Supervised Clustering for Codebook Construction: An Application to Object Localization

Self-Tuning Distribution of DB-Operations on Hybrid CPU/GPU Platforms

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

Semantic Pose using Deep Networks Trained on Synthetic RGB-D

Semantic Segmentation of Colon Glands with Deep Convolutional Neural Networks and Total Variation Segmentation

SemCache: Semantics-aware Caching for Efficient GPU Offloading

Semi-Analytic Solutions to the Radiative Transfer Equations via Hetergeneous Computing

Semi-Global Filtering of Airborne LiDAR Data for Fast Extraction of Digital Terrain Models

Semi-Global Matching-Motivation, Developments and Applications

Separable projection integrals for higher-order correlators of the cosmic microwave sky: Acceleration by factors exceeding 100

Separate Compilation in a Language-Integrated Heterogeneous Environment

Sequence alignment with GPU: Performance and design challenges

Sequence Data Indexing Method Exploiting the Parallel Processing Resources of GPGPU

Sequence Homology Search using Fine-Grained Cycle Sharing of Idle GPUs

Sequence Parallelism: Making 4D Parallelism Possible

Sequential Code Parallelization for Multi-core Embedded Systems: A Survey of Models, Algorithms and Tools

Sequential Consistency for Heterogeneous-Race-Free: Programmer-centric Memory Models for Heterogeneous Platforms

Sequential Monte Carlo Optimisation for Air Traffic Management

Serial and Parallel Bayesian Spam Filtering using Aho-Corasick and PFAC

Serpent encryption algorithm implementation on Compute Unified Device Architecture (CUDA)

Serverless Computing Strategies on Cloud Platforms

Serving LLMs in HPC Clusters: A Comparative Study of Qualcomm Cloud AI 100 Ultra and High-Performance GPUs

SESH framework: A Space Exploration Framework for GPU Application and Hardware Codesign

Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU

SGO: An ultrafast engine for atomic structure global optimization by differential evolution

SGPU 2: a runtime system for using large applications on clusters of hybrid nodes

Shader Performance Analysis on a Modern GPU Architecture

Titles: 100
open PDFs: 95
packages: 11
