1173

Papers on hgpu.org (.txt-file)

Rubus: A compiler for seamless and extensible parallelism Download Package

RUMD: A general purpose molecular dynamics package optimized to utilize GPU hardware down to a few thousand particles Download Package

Run-time Image and Video Resizing Using CUDA-enabled GPUs Download

Run-time Reconfigurable Multiprocessors Download

Run-time support for multi-level disjoint memory address spaces Download

Run, Stencil, Run! – A Comparison of Modern Parallel Programming Paradigms Download

Running Financial Risk Management Applications on FPGA in the Amazon Cloud Download

Running the NIM Next-Generation Weather Model on GPUs

Running unstructured grid-based CFD solvers on modern graphics hardware Download

Running unstructured grid-based CFD solvers on modern graphics hardware Download

Runtime Code Generation and Data Management for Heterogeneous Computing in Java Download

Runtime Comparison of CPU and GPU Using Portable Programming Models Download

Runtime Compilation of Array-Oriented Python Programs Download

Runtime Configurable Deep Neural Networks for Energy-Accuracy Trade-off Download

Runtime Performances Benchmark for Knowledge Graph Embedding Methods Download Package

Runtime Specialization for Heterogeneous CPU-GPU Platforms Download

Runtime Support for Adaptive Power Capping on Heterogeneous SoCs Download

Runtime Support for Performance Portability on Heterogeneous Distributed Platforms Download

Runtime Support toward Transparent Memory Access in GPU-accelerated Heterogeneous Systems Download

Runtime Systems and Scheduling Support for High-End CPU-GPU Architectures Download

Runtime Visualization of Application Progress and Monitoring of a GPU-enabled Parallel Environment Download

S-buffer: Sparsity-aware Multi-fragment Rendering Download Package

SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures Download

SaberLDA: Sparsity-Aware Learning of Topic Models on GPUs Download

Saddle Vertex Graph (SVG): A Novel Solution to the Discrete Geodesic Problem Download

Safe and Practical GPU Acceleration in TrustZone Download

Safe Asynchronous Multicore Memory Operations Download

Safe, Seamless, And Scalable Integration Of Asynchronous GPU Streams In PETSc Download Package

SafeGPU: Contract- and Library-Based GPGPU for Object-Oriented Languages Download Package

SAGA: SystemC Acceleration on GPU Architectures Download

SAGE: Self-Tuning Approximation for Graphics Engines Download

SAIH: A Scalable Evaluation Methodology for Understanding AI Performance Trend on HPC Systems Download

Sailfish: a flexible multi-GPU implementation of the lattice Boltzmann method Download Package

SaLoBa: Maximizing Data Locality and Workload Balance for Fast Sequence Alignment on GPUs Download

Salus: Fine-Grained GPU Sharing Primitives for Deep Learning Applications Download Package

Sample distribution shadow maps Download

SAPPORO: A way to turn your graphics cards into a GRAPE-6 Download Package

Sapporo2: A versatile direct N-body library Download Package

SAR focusing of P-band ice sounding data using back-projection Download

SAR raw signal simulation based on GPU parallel computation

Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10 Download

SBArt4 – Breeding abstract animations in realtime Download

SBLOCK: A Framework for Efficient Stencil-Based PDE Solvers on Multi-core Platforms

SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing Download

Scalability Analysis of Parallel Algorithms on GPU Clusters Download

Scalability Analysis of Synchronous Data-Parallel Artificial Neural Network (ANN) Learners Download

Scalability and Optimization Strategies for GPU Enhanced Neural Networks (GeNN) Download Package

Scalability Evaluation of HPC Multi-GPU Training for ECG-based LLMs Download

Scalability of Higher-Order Discontinuous Galerkin FEM Computations for Solving Electromagnetic Wave Propagation Problems on GPU Clusters

Scalability of Incompressible Flow Computations on Multi-GPU Clusters Using Dual-Level and Tri-Level Parallelism Download

Scalability of Self-organizing Maps on a GPU cluster using OpenCL and CUDA Download

Scalability Study of Deep Learning Algorithms in High Performance Computer Infrastructures Download

Scalable Access-Pattern Aware I/O Acceleration and Multi-Tiered Data Management for HPC and AI Workloads Download Package

Scalable and deterministic timing-driven parallel placement for FPGAs Download Package

Scalable and High Performance Betweenness Centrality on the GPU Download Package

Scalable and highly parallel implementation of Smith-Waterman on graphics processing unit using CUDA

Scalable and Interactive Segmentation and Visualization of Neural Processes in EM Datasets Download

Scalable and massively parallel Monte Carlo photon transport simulations for heterogeneous computing platforms Download Package

Scalable and Parallel Implementation of a Financial Application on a GPU: With Focus on Out-of-Core Case

Scalable Applications on Heterogeneous System Architectures: A Systematic Performance Analysis Framework Download

Scalable approximate k-NN in multidimensional big data Download

Scalable Breadth-First Search on a GPU Cluster Download

Scalable Clustering for Vision using GPUs Download

Scalable Clustering Using Graphics Processors Download

Scalable communication for high-order stencil computations using CUDA-aware MPI Download

Scalable Data Clustering using GPU Clusters Download

Scalable Dense Linear Algebra on Heterogeneous Hardware Download

Scalable Distributed DNN Training using TensorFlow and CUDA-Aware MPI: Characterization, Designs, and Performance Evaluation Download

Scalable Distributed Fast Multipole Methods Download

Scalable Engine and the Performance of Different LLM Models in a SLURM based HPC architecture Download

Scalable Fast Multipole Methods on Distributed Heterogeneous Architectures Download

Scalable Fast Multipole Methods on Heterogeneous Architecture Download

Scalable framework for mapping streaming applications onto multi-GPU systems Download

Scalable GPU Acceleration of B-Spline Signal Processing Operations Download

Scalable GPU rendering of CSG models Download

Scalable GPU-Based Integrity Verification for Large Machine Learning Models Download Package

Scalable heterogeneous parallelism for atmospheric modeling and simulation Download

Scalable instruction set simulator for thousand-core architectures running on GPGPUs Download

Scalable Kernel Fusion for Memory-Bound GPU Applications Download

Scalable Lattice Boltzmann Solvers for CUDA GPU Clusters Download

Scalable learning for object detection with GPU hardware Download

Scalable Metropolis Monte Carlo for simulation of hard shapes Download

Scalable Molecular Dynamics Simulation Using FPGAs and Multicore Processors Download

Scalable Multi Agent Simulation on the GPU Download

Scalable Multi-Cache Simulation Using GPUs Download

Scalable Multi-GPU 3-D FFT for TSUBAME 2.0 Supercomputer Download

Scalable multi-GPU implementation of the MAGFLOW simulator Download

Scalable Multi-GPU Simulation of Long-Range Molecular Dynamics Download

Scalable packet classification via GPU metaprogramming

Scalable Parallel Minimum Spanning Forest Computation Download

Scalable parallel programming with CUDA Download

Scalable Parallel Tridiagonal Algorithms with Diagonal Pivoting and Their Optimization for Many-Core Architectures Download

Scalable Programming Models for Massively Multicore Processors

Scalable Query Evaluation in Relational Databases Download

Scalable Simulation of 3D Wave Propagation in Semi-Infinite Domains Using the Finite Difference Method on a GPU Based Cluster Download

Scalable Simulation of Tsunamis Generated by Submarine Landslides on GPU clusters Download

Scalable SMT-based verification of GPU kernel functions Download Package

Scalable Software Defined FM-radio receiver running on desktop computers

Scalable software defined receivers running on desktop computers using General Purpose Graphics Processing Units

Scalable Solution of Radiative Heat Transfer Problems by the Photon Monte Carlo Algorithm on Hybrid Computing Architectures Download

 

Brief statistics for this page

Titles: 100

Doubles=1

Download open PDFs: 90

Package packages: 17

* * *

* * *

HGPU group © 2010-2026 hgpu.org

All rights belong to the respective authors

Contact us: