Papers on hgpu.org (.txt-file)
Sparse Matrix Multiplication using CUDA and Mex Interface
Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms
Sparse matrix solvers on the GPU: conjugate gradients and multigrid
Sparse Matrix-Matrix Multiplication on Multilevel Memory Architectures : Algorithms and Experiments
Sparse matrix-vector multiplication on GPGPU clusters: A new storage format and a scalable implementation
Sparse Matrix-Vector Multiplication on GPGPUs
Sparse Matrix-Vector Multiplication on GPU
Sparse Matrix-Vector Multiplication on NVIDIA GPU
Sparse Recovery on GPUs: Accelerating the Iterative Soft-Thresholding Algorithm
Sparse regularization in MRI iterative reconstruction using GPUs
Sparse systems solving on GPUs with GMRES
Sparse Winograd Convolutional neural networks on small-scale systolic arrays
Sparse-Matrix support for the SkePU library for portable CPU/GPU programming
Sparse-Matrix-CG-Solver in CUDA
Sparselet Models for Efficient Multiclass Object Detection
Sparser, Better, Faster GPU Parsing
Spatial Data Structures, Sorting and GPU Parallelism for Situated-agent Simulation and Visualisation
Spatial Indexing of Large-Scale Geo-Referenced Point Data on GPGPUs Using Parallel Primitives
Spatial interpolation in massively parallel computing environments
Spatial interpolation of scattered geoscientific data
Spatial Join with R-Tree on Graphics Processing Units
Spatial Sorting Algorithms for Parallel Computing in Networks
Spatial splits in bounding volume hierarchies
Spatial: A Language and Compiler for Application Accelerators
Spatio-temporal upsampling on the GPU
Spatter: A Benchmark Suite for Evaluating Sparse Access Patterns
Special Relativistic Visualization by Local Ray Tracing
Specification and verification of GPGPU programs
Specification and Verification of GPGPU Programs using Permission-Based Separation Logic
Speckle Reduction with Trained Nonlinear Diffusion Filtering
Spectral classification using convolutional neural networks
Spectral Ewald Acceleration of Stokesian Dynamics for polydisperse suspensions
Spectral Method Characterization on FPGA and GPU Accelerators
Spectral volume rendering using GPU-based raycasting
Specular Effects on the GPU: State of the Art
Speculative Execution of Parallel Programs with Precise Exception Semantics on GPUs
Speculative Execution on GPU: An Exploratory Study
Speculative Execution on Multi-GPU Systems
Speculative Parallel Evaluation Of Classification Trees On GPGPU Compute Engines
Speculative Parallelization on GPGPUs
Speculative Segmented Sum for Sparse Matrix-Vector Multiplication on Heterogeneous Processors
Specx: a C++ task-based runtime system for heterogeneous distributed architectures
Speech Recognition on Modern Graphic Processing Units
Speech Recognition on Multi-Core Processors and GPUs
Speed and Portability issues for Random Number Generation on Graphical Processing Units with CUDA and other Processing Accelerators
Speed sign detection and recognition by convolutional neural networks
Speed up Large Integer Multiplication Using Fourier Transforms and CUDA Technology
Speed-Up Improvement Using Parallel Approach in Image Steganography
Speed, power and cost implications for GPU acceleration of Computational Fluid Dynamics on HPC systems
Speeding up a few orders of magnitude the Jacobi method: high order Chebyshev-Jacobi over GPUs
Speeding up a Video Summarization Approach Using GPUs and Multicore CPUs
Speeding up Automatic Hyperparameter Optimization of Deep Neural Networks by Extrapolation of Learning Curves
Speeding Up Computer Vision Applications on Mobile Computing Platforms
Speeding Up Cycle Based Logic Simulation Using Graphics Processing Units
Speeding Up Geospatial Polygon Rasterization on GPGPUs
Speeding Up Homomorpic Hashing Using GPUs
Speeding up K-Means Algorithm by GPUs
Speeding up Large-Scale Point-in-Polygon Test Based Spatial Join on GPUs
Speeding up lattice sieve with Xeon Phi coprocessor
Speeding up LIP-Canny with CUDA programming
Speeding Up Model Building for ECGA on CUDA Platform
Speeding up Mutual Information Computation Using NVIDIA CUDA Hardware
Speeding Up Object Detection: Fast Resizing in the Integral Image Domain
Speeding Up Particle Trajectory Simulations under Moving Force Fields using GPUs
Speeding Up Reinforcement Learning with Graphics Processing Units
Speeding up Scoring Module of Mass Spectrometry Based Protein Identification by GPU
Speeding up subset seed algorithm for intensive protein sequence comparison
Speeding up the evaluation of evolutionary learning systems using GPGPUs
Speeding up the evaluation phase of GP classification algorithms on GPUs
Speeding up the MATLAB complex networks package using graphic processors
Speeding up the MATLAB Hyperspectral Image Analysis Toolbox using GPUs and the Jacket Toolbox
Speeding up the small progress measures algorithm for parity games using the GPU
Speeding-up Pearson Correlation Coefficient calculation on graphical processing units
Speeding-up the Verification Phase of Set Similarity Joins in the GPGPU paradigm
Speedup and Parallelization Models for Energy-Efficient Many-Core Systems Using Performance Counters
Speedup for quantum optimal control from GPU-based automatic differentiation
Speedup of Fuzzy Clustering Through Stream Processing on Graphics Processing Units
Speedup of Micromagnetic Simulations with C++ AMP On Graphics Processing Units
Speedup of Type-1 Fuzzy Logic Systems on Graphics Processing Units Using CUDA
Speedups between x70 and x120 for a generic local search (memetic) algorithm on a single GPGPU chip
sPEGG: high throughput eco-evolutionary simulations on commodity graphics processors
SPH Based Fluid Animation Using CUDA Enabled GPU
SPH Fluids for Viscous Jet Buckling
Spherical harmonic transform on heterogeneous architectures using hybrid programming
Spherical harmonic transform with GPUs
Spiking Neural Networks for Real-Time Infrared Images Processing in Thermo Vision Systems
SPIRE, a Sequential to Parallel Intermediate Representation Extension
Split tiling for GPUs: automatic parallelization using trapezoidal tiles
Splotch: porting and optimizing for the Xeon Phi
SpMV: A Memory-Bound Application on the GPU Stuck Between a Rock and a Hard Place
SPOC: GPGPU Programming Through Stream Processing With OCaml
Sponge: portable stream programming on graphics engines
Spotting Radio Transients with the help of GPUs
SPRAT: Runtime processor selection for energy-aware computing
Spring-Bead Animation of Viscoelastic Materials
Springald: GPU-Accelerated Window-Based Aggregates Over Out-of-Order Data Streams
Spyx: A Library for Just-In-Time Compiled Optimization of Spiking Neural Networks
SqueezCL: Squeezing OpenCL Kernels for Approximate Computing on Contemporary GPUs
Titles: 100
open PDFs: 94
packages: 20