Papers on hgpu.org (.txt-file)
Data-Driven Analysis and Design of Vulkan Ray-Tracing Applications using Automatic Instrumentation

Data-Driven Dynamic Autotuning: Optimizing Autotuning Overhead with Prior Tuning Data

Data-driven Forecasting of Deep Learning Performance on GPUs

Data-driven Performance Optimization for Data-intensive Applications

Data-Driven Programming Abstractions and Optimization for Multi-Core Platforms

Data-driven versus Topology-driven Irregular Computations on GPUs

Data-efficient LLM Fine-tuning for Code Generation

Data-intensive document clustering on GPU clusters

Data-intensive document clustering on graphics processing unit (GPU) clusters

Data-Oriented Language Implementation of Lattice-Boltzmann Method for Dense and Sparse Geometries

Data-parallel Acceleration of PARSEC Black-Scholes Benchmark

Data-parallel algorithms and data structures

Data-parallel algorithms for large-scale real-time simulation of the cellular potts model on graphics processing units

Data-Parallel Construction of delta_N-Nets with Maximum Dispersion

Data-Parallel Flattening by Expansion

Data-Parallel Hashing Techniques for GPU Architectures

Data-Parallel Octrees for Surface Reconstruction

Data-Parallelism and GPUs for Lattice Gas Fluid Simulations

Data-rich astronomy: mining synoptic sky surveys

Database Operation Development on the GPU

Dataflow-based Design and Implementation of Image Processing Applications

Dataflow-Based Implementation of Layered Sensing Applications

Dataflow-driven GPU performance projection for multi-kernel transformations

Dataloader Parameter Tuner: An Automated Dataloader Parameter Tuner for Deep Learning Models

Dato: A Task-Based Programming Model for Dataflow Accelerators

Daubechies wavelets for high performance electronic structure calculations: The BigDFT project

Dawn of GPU Era-Potentials of Chaos Theory

DawnCC: a Source-to-Source Automatic Parallelizer of C and C++ Programs

Dax Toolkit: A Proposed Framework for Data Analysis and Visualization at Extreme Scale

DBCSR: A Library for Dense Matrix Multiplications on Distributed GPU-Accelerated Systems

DBMS Index for Hierarchical Data Using Nested Intervals and Residue Classes

DC Power Flow Based Contingency Analysis Using Graphics Processing Units
DC Power Flow Based Contingency Analysis Using Graphics Processing Units (thesis)

DCT-JPEG Image Coding Based on GPU

dCUDA: hardware supported overlap of computation and communication

De-specializing an HLS library for Deep Neural Networks: improvements upon hls4ml

Dealing With Big Data Outside Of The Cloud: GPU Accelerated Sort

Debugging GPU stream programs through automatic dataflow recording and visualization

Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU

Debunking the CUDA Myth Towards GPU-based AI Systems

DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI

Declarative Parallel Programming for GPUs

Decoding with Finite-State Transducers on GPUs

Decompiling x86 Deep Neural Network Executables

Decoupled Access/Execute Metaprogramming for GPU-Accelerated Systems

Decoupled Block-Wise ILU(k) Preconditioner on GPU

Decoupled Deferred Shading for Hardware Rasterization

Decoupled Vector-Fetch Architecture with a Scalarizing Compiler

Decoupling Algorithms from Schedules for Easy Optimization of Image Processing Pipelines

Decoupling algorithms from the organization of computation for high performance image processing

Decreasing NAME III Solution Time Using GP-GPU

Decryption-decompression of AES protected ZIP files on GPUs

Deductive verification for SYCL

Deep and Shallow convections in Atmosphere Models on Intel Xeon Phi Coprocessor Systems

Deep Architectures for Neural Machine Translation

Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition

Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition

Deep Convolutional Network evaluation on the Intel Xeon Phi: Where Subword Parallelism meets Many-Core

Deep convolutional networks for pancreas segmentation in CT imaging

Deep Convolutional Neural Networks for Smile Recognition

Deep Dynamic Neural Networks for Gesture Segmentation and Recognition

Deep Feature-based Face Detection on Mobile Devices

Deep Fluids: A Generative Network for Parameterized Fluid Simulations

Deep Graph Learning for Program Analysis and System Optimization

Deep Graph Library Optimizations for Intel(R) x86 Architecture

Deep Language Models for Software Testing and Optimisation

Deep Learning and Machine Learning with GPGPU and CUDA: Unlocking the Power of Parallel Computing

Deep Learning Application in Plant Stress Imaging: A Review

Deep Learning Approaches to Source Code Analysis for Optimization of Heterogeneous Systems: Recent Results, Challenges and Opportunities

Deep Learning At Scale and At Ease

Deep Learning Based FPGA-CPU Acceleration

Deep Learning by Doing: The NVIDIA Deep Learning Institute and University Ambassador Program

Deep Learning for Computational Chemistry

Deep Learning for Computer Vision: A comparison between Convolutional Neural Networks and Hierarchical Temporal Memories on object recognition tasks

Deep Learning for Digital Asset Limit Order Books

Deep learning for galaxy surface brightness profile fitting

Deep Learning for Mortgage Risk

Deep Learning for Obfuscated Code Analysis

Deep Learning For Smile Recognition

Deep Learning in the Automotive Industry: Applications and Tools

Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls

Deep Learning Model Security: Threats and Defenses

Deep Learning Models on CPUs: A Methodology for Efficient Training

Deep Learning on FPGAs: Past, Present, and Future

Deep learning review and its applications

Deep learning with COTS HPC systems

Deep Learning Workload Scheduling in GPU Datacenters: A Survey

Deep learning: A guide for practitioners in the physical sciences

Deep Neural Machine Translation with Weakly-Recurrent Units

Deep neural networks for direct, featureless learning through observation: the case of 2d spin models

Titles: 100
open PDFs: 98
packages: 22
