1173

Papers on hgpu.org (.txt-file)

Surface Compression Using Dynamic Color Palettes Download

Surface Normal Integration for Convex Space-time Multi-view Reconstruction Download

Surface quality assessment of subdivision surfaces on programmable graphics hardware Download

Surface Reconstruction from Scattered Point via RBF Interpolation on GPU Download

Survey and Benchmarking of Machine Learning Accelerators Download

Survey of Domain-Specific Languages for FPGA Computing Download

Survey of GPU water simulation in game engine

Survey on Benchmarks for a GPU Based Multi Camera Stereo Matching Algorithm Download

Survey on Efficient Linear Solvers for Porous Media Flow Models on Recent Hardware Architectures Download

Survey On The Off-Chip Scheduling of Memory Accesses in the Memory Interface Of GPUs Download

Survey paper on Deep Learning on GPUs Download

Sustainable GPU Computing at Scale Download

Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale Download

SW# – GPU enabled exact alignments on genome scale Download Package

SW#db: GPU-accelerated exact sequence similarity database search Download Package

Swan: A tool for porting CUDA programs to OpenCL Download Package

SWAPHI: Smith-Waterman Protein Database Search on Xeon Phi Coprocessors Download Package

Swarm-NG: a CUDA Library for Parallel n-body Integrations with focus on Simulations of Planetary Systems Download Package

Swarm’s flight: Accelerating the particles using C-CUDA

swCaffe: a Parallel Framework for Accelerating Deep Learning Applications on Sunway TaihuLight Download Package

swCUDA: Auto parallel code translation framework from CUDA to ATHREAD for new generation sunway supercomputer Download

Swendsen-Wang Multi-Cluster Algorithm for the 2D/3D Ising Model on Xeon Phi and GPU Download

Swept Volume approximation of polygon soups Download

SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences Download Package

Switching to High Gear: Opportunities for Grand-Scale Real-Time Parallel Simulations Download

Swizzle Inventor: Data Movement Synthesis for GPU Kernels Download

SWM: Simplified Wu-Manber for GPU-based Deep Packet Inspection Download

SWPS3 – fast multi-threaded vectorized Smith-Waterman for IBM Cell/B.E. and x86/SSE2 Download Package

SYCL Code Generation for Multigrid Methods Download Package

SYCL compute kernels for ExaHyPE Download Package

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration Download Package

SYCL in the Edge: Performance Evaluation for Heterogeneous Acceleration Download Package

SYCL-Bench 2020: Benchmarking SYCL 2020 on AMD, Intel, and NVIDIA GPUs Download Package

SYCL-Bench: A Versatile Cross-Platform Benchmark Suite for Heterogeneous Computing Download Package

SYCL-Bench: A Versatile Single-Source Benchmark Suite for Heterogeneous Computing Download Package

SYCLops: A SYCL Specific LLVM to MLIR Converter Download

Sylkan: Towards a Vulkan Compute Target Platform for SYCL Download

Symbolic Crosschecking of Data-Parallel Floating Point Code Download Package

Symbolic crosschecking of floating-point and SIMD code Download Package

Symbolic Differentiation in GPU Shaders Download

Symbolic Testing of OpenCL Code Download Package

Symphony: A Scheduler for Client-Server Applications on Coprocessor-based Heterogeneous Clusters Download

Synchronization and Coordination in Heterogeneous Processors Download

Synchronization and Ordering Semantics in Hybrid MPI+GPU Programming Download

Synergia CUDA: GPU-accelerated accelerator modeling package Download Package

Synergistic CPU-FPGA Acceleration of Sparse Linear Algebra Download

Synergistic execution of stream programs on multicores with accelerators Download Package

SYnergy: Fine-grained Energy-Efficient Heterogeneous Computing for Scalable Energy Saving Download

Synkhronos: a Multi-GPU Theano Extension for Data Parallelism Download Package

Synthesis and rendering of bidirectional texture functions on arbitrary surfaces Download

Synthesis of Custom Networks of Heterogeneous Processing Elements for Complex Physical System Emulation Download

Synthesis of Embedded Software using Dataflow Schedule Graphs Download

Synthesis of GPU Programs from High-Level Models Download

Synthesis of Platform Architectures from OpenCL Programs Download

Synthesizing Benchmarks for Predictive Modeling Download Package

Synthesizing Software from a ForSyDe Model Targeting GPGPUs Download

Synthesizing Structured Traversals from Attribute Grammars Download

Synthesizing Subdivision Meshes Using Real Time Tessellation Download

Synthetic Aperture Beamformation using the GPU Download

Synthetic Aperture Radar imaging on a CUDA-enabled mobile platform Download

Synthetic Aperture Radar Processing with GPGPU Download

Syntix: A Profiling Based Resource Estimator for CUDA Kernels Download

System Design Principles for Heterogeneous Resource Management and Scheduling in Accelerator-Based Systems Download

System integration of FastSPECT III, a dedicated SPECT rodent-brain imager based on BazookaSPECT detector technology Download

System-Level Optimization and Code Generation for Graphics Processors using a Domain-Specific Language Download Package

Systematic Approach in Optimizing Numerical Memory-Bound Kernels on GPU Download

Systematic construction, verification and implementation methodology for LDPC codes Download

Systematic Performance Optimization of Cone-Beam Back-Projection on the Kepler Architecture Download

Systematic Physics Constrained Parameter Estimation of Stochastic Differential Equations Download

SystemC simulation on GP-GPUs: CUDA vs. OpenCL Download

Systolic-CNN: An OpenCL-defined Scalable Run-time-flexible FPGA Accelerator Architecture for Accelerating Convolutional Neural Network Inference in Cloud/Edge Computing Download Package

SZx: an Ultra-fast Error-bounded Lossy Compressor for Scientific Datasets Download

TABLA: A Unified Template-based Framework for Accelerating Statistical Machine Learning Download

Tabu Search on GPU Download

Tabu Search with two approaches to parallel flowshop evaluation on CUDA platform

Tackling Exascale Software Challenges in Molecular Dynamics Simulations with GROMACS Download

Tactics to Directly Map CNN graphs on Embedded FPGAs Download Package

Taichi: A Language for High-Performance Computation on Spatially Sparse Data Structures Download Package

Takagi Factorization on GPU using CUDA Download

Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes Download

Taking the graphics processor beyond graphics Download

Taming irregular EDA applications on GPUs

Taming the complexities of the C11 and OpenCL memory models Download Package

Tamp: A Library for Compact Deep Neural Networks with Structured Matrices Download Package

Tangible video teleconference system using real-time image-based relighting Download

Tango: A Deep Neural Network Benchmark Suite for Various Accelerators Download Package

Tangram: a High-level Language for Performance Portable Code Synthesis Download

TAP: A TLP-Aware Cache Management Policy for a CPU-GPU Heterogeneous Architecture Download

Tapping the supercomputer under your desk: Solving dynamic equilibrium models with graphics processors Download Package

Tapping the supercomputer under your desk: solving dynamic equilibrium models with graphics processors? Download Package

Target Marker: A Visual Marker for Long Distances and Detection in Realtime on Mobile Devices Download

targetDP: an Abstraction of Lattice Based Parallelism with Portable Performance Download

Targeting GPUs with OpenMP Directives on Summit: A Simple and Effective Fortran Experience Download Package

Targeting heterogeneous architectures via macro data flow Download

Task and Data Distribution in Hybrid Parallel Systems Download

Task management for irregular-parallel workloads on the GPU Download

Task migration of DSP application specified with a DFG and implemented with the BSP computing model on a CPU-GPU cluster Download

Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout Download Package

Task Parallelism and Data Distribution: An Overview of Explicit Parallel Programming Languages Download

Task Parallelism and Synchronization: An Overview of Explicit Parallel Programming Languages Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 96

Package packages: 33

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: