1173

Papers on hgpu.org (.txt-file)

vSMC: Parallel Sequential Monte Carlo in C++ Download Package

Vulkan 1.1.97 – A Specification (with all registered Vulkan extensions) Download

Vulnerability Analysis and Attacks on Intel Xeon Phi Coprocessor Download

Vulnerable GPU Memory Management: Towards Recovering Raw Data from GPU Download

Wait-free programming for general purpose computations on graphics processors Download

waLBerla: A block-structured high-performance framework for multiphysics simulations Download Package

Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning Download

Wanted: Floating-Point Add Round-off Error instruction Download Package

Warp Size Impact in GPUs: Large or Small? Download

Warp-Level Divergence in GPUs: Characterization, Impact, and Mitigation Download

Warp-Level Parallelism: Enabling Multiple Replications In Parallel on GPU Download

WarpCore: A Library for fast Hash Tables on GPUs Download

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU Download Package

Warped Register File: A Power Efficient Register File for GPGPUs Download

Warps and Atomics: Beyond Barrier Synchronization in the Verification of GPU Kernels Download Package

Wasserstein-Fisher-Rao Document Distance Download

Waste Not, Want Not! Managing relational data in asymmetric memories Download

Waste Not… Efficient Co-Processing of Relational Data Download

Water simulation based on HLSL Download

Water simulation for cell based sandbox games Download

Water Surface Animation using Damped Wave Equation and CUDA Acceleration Download

wav2letter++: The Fastest Open-source Speech Recognition System Download Package

Wave field synthesis for 3D audio: architectural prospectives Download

Wavefront raycasting using larger filter kernels for on-the-fly GPU gradient reconstruction

Wavelet Encoding and Multi-GPU Programming Download

Wavelet Model-based Stereo for Fast, Robust Face Reconstruction Download

WAYPOINT: scaling coherence to thousand-core architectures Download

WCCV: Improving the Vectorization of IF-statements with Warp-Coherent Conditions Download Package

Weak execution ordering – exploiting iterative methods on many-core GPUs Download

WebCL for Hardware-Accelerated Web Applications Download Package

Weighted Block-Asynchronous Iteration on GPU-Accelerated Systems Download

Weighted Residuals for Very Deep Networks Download

What you see is what you snap: snapping to geometry deformed on the GPU Download

When HLS Meets FPGA HBM: Benchmarking and Bandwidth Optimization Download

When Machine Learning Meets Quantum Computers: A Case Study Download

Where is the data? Why you cannot debate CPU vs. GPU performance without the answer Download

Whippletree: Task-based Scheduling of Dynamic Workloads on the GPU Download Package

Whole-function vectorization Download

Why does PHM matter? – Nvidia’s GPU problems reviewed

Why is FPGA-GPU Heterogeneity the Best Option for Embedded Deep Neural Networks? Download

Why it is time for a HyPE: A Hybrid Query Processing Engine for Efficient GPU Coprocessing in DBMS Download

Wideband Channelization for Software-Defined Radio via Mobile Graphics Processors Download

Wilson and Domainwall Kernels on Oakforest-PACS Download Package

Winograd Algorithm for AdderNet Download

Wire Speed Name Lookup: A GPU-based Approach Download

Wireless Interference Identification with Convolutional Neural Networks Download

word2ket: Space-efficient Word Embeddings inspired by Quantum Entanglement Download

Work Efficient Parallel Algorithms for Large Graph Exploration Download

Work in Progress: Vortex Detection and Visualization for Design of Micro Air Vehicles and Turbomachinery Download

Work Stealing Inside GPUs Download

Work-Efficient Parallel GPU Methods for Single-Source Shortest Paths Download Package

Working With Incremental Spatial Data During Parallel (GPU) Computation Download Package

Workload Analysis and Efficient OpenCL-based Implementation of SIFT Algorithm on a Smartphone Download

Workload and network-optimized computing systems

Workload Aware Algorithms for Heterogeneous Platforms Download

Workload Balancing on Heterogeneous Systems: A Case Study of Sparse Grid Interpolation Download

Workload Characterization of 3D Games Download

Workload distribution and balancing in FPGAs and CPUs with OpenCL and TBB Download

Workload Scheduling on Heterogeneous Devices Download

Workload-aware Automatic Parallelization for Multi-GPU DNN Training Download

Worst-Case Execution Time Guarantees for Runtime-Reconfigurable Architectures Download

WPA/WPA2 Password Security Testing using Graphics Processing Units Download

Wrinkling Coarse Meshes on the GPU Download

Writing a modular GPGPU program in Java Download

Writing a performance-portable matrix multiplication Download Package

Writing self-adaptive codes for heterogeneous systems Download Package

X-Device Query Processing by Bitwise Distribution Download

X-ray CT on the GPU Download

X-toon: an extended toon shader Download

XBOOLE-CUDA: Fast Boolean Operations on the GPU Download

Xbox 360 System Architecture Download

Xbox360 Front Side Bus – A 21.6 GB/s End-to-End Interface Design Download

Xeon Phi: A comparison between the newly introduced MIC architecture and a standard CPU through three types of problems Download Package

XeonPhi Meets Astrophysical Fluid Dynamics Download

XGBoost: Scalable GPU Accelerated Learning Download Package

XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures Download Package

XMalloc: A Scalable Lock-free Dynamic Memory Allocator for Many-core Machines Download

XML3D: interactive 3D graphics for the web Download Package

XMT-GPU: A PRAM Architecture for Graphics Computation Download

XSD: Accelerating MapReduce by Harnessing the GPU inside an SSD Download

YaDiV-an open platform for 3D visualization and 3D segmentation of medical data Download Package

Yang-Mills lattice on CUDA Download

YodaNN: An Ultra-Low Power Convolutional Neural Network Accelerator Based on Binary Weights Download

You Can Type, but You Can’t Hide: A Stealthy GPU-based Keylogger Download

Ypnos: declarative, parallel structured grid programming Download

ytopt: Autotuning Scientific Applications for Energy Efficiency at Large Scales Download Package

ZAME: Interactive Large-Scale Graph Visualization Download

Zero-copy I/O processing for low-latency GPU computing Download

Zeus: Understanding and Optimizing GPU Energy Consumption of DNN Training Download Package

Zippy: A Framework for Computation and Visualization on a GPU Cluster

ZNN – A Fast and Scalable Algorithm for Training 3D Convolutional Networks on Multi-Core and Many-Core Shared Memory Machines Download Package

Zorua: Enhancing Programming Ease, Portability, and Performance in GPUs by Decoupling Programming Models from Resource Management Download

ZUCL: A ZYNQ UltraScale+ Framework for OpenCL HLS Applications Download Package

 

Brief statistics for this page

Titles: 93

Download open PDFs: 89

Package packages: 23

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: