high performance computing on graphics processing units: hgpu.org

Papers on hgpu.org (.txt-file)

Visibility Cuts: A System for Rendering Dynamic Virtual Environments

Visibility Sampling on GPU and Applications

Vision based Navigation (VBN) of Unmanned Aerial Vehicles (UAV)

Vispark: GPU-Accelerated Distributed Visual Computing Using Spark

VisPy: Harnessing The GPU For Fast, High-Level Visualization

Visual Analysis Algorithms for Embedded Systems

Visual Computing in Biology and Medicine: Interactive visual analysis of contrast-enhanced ultrasound data based on small neighborhood statistics

Visual cortex on the GPU: Biologically inspired classifier and feature descriptor for rapid recognition

Visual Data Mining Using the Point Distribution Tensor

Visual Human – Machine Learning

Visual Performance Analysis of Memory Behavior in a Task-Based Runtime on Hybrid Platforms

Visual Signatures in Video Visualization

Visual Simulation of Breaking Waves in Shallow Water

Visual Simulation of Flow

Visual Simulation of Heat Shimmering and Mirage

Visual simulation of shockwaves

Visual simulation of thermal fluid dynamics in a pressurized water reactor

Visual system design for excavator simulator with deformable terrain

Visual-model-based, real-time 3D pose tracking for autonomous navigation: methodology and experiments

Visual, Spatial and Temporal Quality in Video-Based Reconstruction of People: Achieving, Prototyping and Evaluating

Visualisation of Physical Lung Simulation: an Interactive Application to Assist Physicians

Visualising Interfaces in Scalar and Vector Field-Model Simulations

Visualising spins and clusters in regular and small-world Ising models with GPUs

Visualization and Analysis of GPU Summer School Applicants and Participants

Visualization and Correction of Automated Segmentation, Tracking and Lineaging from 5-D Stem Cell Image Sequences

Visualization and GPU-accelerated simulation of medical ultrasound from CT images

Visualization assisted by parallel processing

Visualization in the Einstein Year 2005: a case study on explanatory and illustrative visualization of relativity and astrophysics

Visualization of Astronomical Nebulae via Distributed Multi-GPU Compressed Sensing Tomography

Visualization of Fibrous and Thread-like Data

Visualization of large multidimensional data sets by using multi-core CPU, GPU and MPI cluster

Visualization of Large Volumetric Multi-Channel Microscopy Data Streams on Standard PCs

Visualization of level-of-detail meshes on the GPU

Visualization of LIDAR datasets using point-based rendering technique

Visualization of OpenCL Application Execution on CPU-GPU Systems

Visualization of Pareto Solutions by Spherical Self-Organizing Map and It’s acceleration on a GPU

Visualization of structured nonuniform grids

Visualization Tool for GPGPU Programming

Visualization with stylized line primitives

Visualizing and Analyzing the Mona Lisa

Visualizing complex dynamics in many-core accelerator architectures

Visualizing Complex Functions Using GPUs

Visualizing Multiwavelength Astrophysical Data

Visualizing the Radiation of the Kelvin-Helmholtz Instability

Visualizing Trends on Twitter

VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing

Vivaldi: A Domain-Specific Language for Volume Processing and Visualization on Distributed Heterogeneous Systems

Vlasov on GPU (VOG Project)

VOCL: An Optimized Environment for Transparent Virtualization of Graphics Processing Units

Voice Command Recognition with Dynamic Time Warping (DTW) using Graphics Processing Units (GPU) with Compute Unified Device Architecture (CUDA)

VolQD: Direct Volume Rendering of Multi-million Atom Quantum Dot Simulations

Volume and Isosurface Rendering with GPU-Accelerated Cell Projection

Volume exploration using ellipsoidal Gaussian transfer functions

Volume Raycasting Performance Using DirectCompute

Volume rendering visualization of 3D spherical mantle convection with an unstructured mesh

Volume Visualization: A Technical Overview with a Focus on Medical Applications

Volume-preserving FFD for programmable graphics hardware

Volumetric Ambient Occlusion

Volumetric Ambient Occlusion for Real-Time Rendering and Games

Volumetric Rendering Techniques for Scientific Visualization

Voreen: A Rapid-Prototyping Environment for Ray-Casting-Based Volume Visualizations

Voronoi Toolpaths for PCB Mechanical Etch: Simple and Intuitive Algorithms with the 3D GPU

Vortex Methods for Fluid Simulation in Computer Graphics

Vortex methods for incompressible flow simulations on the GPU

Vortex particle method and parallel computing

Vortex: Overcoming Memory Capacity Limitations in GPU-Accelerated Large-Scale Data Analytics

Voxelized Minkowski sum computation on the GPU with robust culling

VoxelPipe: a programmable pipeline for 3D voxelization

Voxels on fire

VSIPL++ Acceleration Using Commodity Graphics Processors

vSMC: Parallel Sequential Monte Carlo in C++

Vulkan 1.1.97 – A Specification (with all registered Vulkan extensions)

Vulnerability Analysis and Attacks on Intel Xeon Phi Coprocessor

Vulnerable GPU Memory Management: Towards Recovering Raw Data from GPU

Wait-free programming for general purpose computations on graphics processors

waLBerla: A block-structured high-performance framework for multiphysics simulations

Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning

Wanted: Floating-Point Add Round-off Error instruction

Warp Size Impact in GPUs: Large or Small?

Warp-Level Divergence in GPUs: Characterization, Impact, and Mitigation

Warp-Level Parallelism: Enabling Multiple Replications In Parallel on GPU

WarpCore: A Library for fast Hash Tables on GPUs

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

Warped Register File: A Power Efficient Register File for GPGPUs

Warps and Atomics: Beyond Barrier Synchronization in the Verification of GPU Kernels

Wasserstein-Fisher-Rao Document Distance

Waste Not, Want Not! Managing relational data in asymmetric memories

Waste Not… Efficient Co-Processing of Relational Data

Water simulation based on HLSL

Water simulation for cell based sandbox games

Water Surface Animation using Damped Wave Equation and CUDA Acceleration

wav2letter++: The Fastest Open-source Speech Recognition System

Wave field synthesis for 3D audio: architectural prospectives

Wavefront raycasting using larger filter kernels for on-the-fly GPU gradient reconstruction

Wavelet Encoding and Multi-GPU Programming

Wavelet Model-based Stereo for Fast, Robust Face Reconstruction

WAYPOINT: scaling coherence to thousand-core architectures

WCCV: Improving the Vectorization of IF-statements with Warp-Coherent Conditions

Weak execution ordering – exploiting iterative methods on many-core GPUs

WebCL for Hardware-Accelerated Web Applications

Brief statistics for this page

Titles: 100

Download open PDFs: 86

Package packages: 14

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Efficient deep learning inference on end devices

Ouroboros: Virtualized Queues for dynamic memory management

Dynamic Memory Management on GPUs with SYCL

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Papers on hgpu.org (.txt-file)

Recent source codes

XaaS containers

microSYCL: SYCL micro-benchmarks repository

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

Most viewed papers (last 30 days)