Papers on hgpu.org (.txt-file)
Data-Oriented Language Implementation of Lattice-Boltzmann Method for Dense and Sparse Geometries
Data-parallel Acceleration of PARSEC Black-Scholes Benchmark
Data-parallel algorithms and data structures
Data-parallel algorithms for large-scale real-time simulation of the cellular potts model on graphics processing units
Data-Parallel Construction of delta_N-Nets with Maximum Dispersion
Data-Parallel Flattening by Expansion
Data-Parallel Hashing Techniques for GPU Architectures
Data-Parallel Octrees for Surface Reconstruction
Data-Parallelism and GPUs for Lattice Gas Fluid Simulations
Data-rich astronomy: mining synoptic sky surveys
Database Operation Development on the GPU
Dataflow-based Design and Implementation of Image Processing Applications
Dataflow-Based Implementation of Layered Sensing Applications
Dataflow-driven GPU performance projection for multi-kernel transformations
Dataloader Parameter Tuner: An Automated Dataloader Parameter Tuner for Deep Learning Models
Daubechies wavelets for high performance electronic structure calculations: The BigDFT project
Dawn of GPU Era-Potentials of Chaos Theory
DawnCC: a Source-to-Source Automatic Parallelizer of C and C++ Programs
Dax Toolkit: A Proposed Framework for Data Analysis and Visualization at Extreme Scale
DBCSR: A Library for Dense Matrix Multiplications on Distributed GPU-Accelerated Systems
DBMS Index for Hierarchical Data Using Nested Intervals and Residue Classes
DC Power Flow Based Contingency Analysis Using Graphics Processing Units
DC Power Flow Based Contingency Analysis Using Graphics Processing Units (thesis)
DCT-JPEG Image Coding Based on GPU
dCUDA: hardware supported overlap of computation and communication
De-specializing an HLS library for Deep Neural Networks: improvements upon hls4ml
Dealing With Big Data Outside Of The Cloud: GPU Accelerated Sort
Debugging GPU stream programs through automatic dataflow recording and visualization
Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU
DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI
Declarative Parallel Programming for GPUs
Decoding with Finite-State Transducers on GPUs
Decompiling x86 Deep Neural Network Executables
Decoupled Access/Execute Metaprogramming for GPU-Accelerated Systems
Decoupled Block-Wise ILU(k) Preconditioner on GPU
Decoupled Deferred Shading for Hardware Rasterization
Decoupled Vector-Fetch Architecture with a Scalarizing Compiler
Decoupling Algorithms from Schedules for Easy Optimization of Image Processing Pipelines
Decoupling algorithms from the organization of computation for high performance image processing
Decreasing NAME III Solution Time Using GP-GPU
Decryption-decompression of AES protected ZIP files on GPUs
Deductive verification for SYCL
Deep and Shallow convections in Atmosphere Models on Intel Xeon Phi Coprocessor Systems
Deep Architectures for Neural Machine Translation
Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition
Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition
Deep Convolutional Network evaluation on the Intel Xeon Phi: Where Subword Parallelism meets Many-Core
Deep convolutional networks for pancreas segmentation in CT imaging
Deep Convolutional Neural Networks for Smile Recognition
Deep Dynamic Neural Networks for Gesture Segmentation and Recognition
Deep Feature-based Face Detection on Mobile Devices
Deep Fluids: A Generative Network for Parameterized Fluid Simulations
Deep Graph Learning for Program Analysis and System Optimization
Deep Graph Library Optimizations for Intel(R) x86 Architecture
Deep Language Models for Software Testing and Optimisation
Deep Learning Application in Plant Stress Imaging: A Review
Deep Learning Approaches to Source Code Analysis for Optimization of Heterogeneous Systems: Recent Results, Challenges and Opportunities
Deep Learning At Scale and At Ease
Deep Learning Based FPGA-CPU Acceleration
Deep Learning by Doing: The NVIDIA Deep Learning Institute and University Ambassador Program
Deep Learning for Computational Chemistry
Deep Learning for Computer Vision: A comparison between Convolutional Neural Networks and Hierarchical Temporal Memories on object recognition tasks
Deep Learning for Digital Asset Limit Order Books
Deep learning for galaxy surface brightness profile fitting
Deep Learning for Mortgage Risk
Deep Learning for Obfuscated Code Analysis
Deep Learning For Smile Recognition
Deep Learning in the Automotive Industry: Applications and Tools
Deep Learning Models on CPUs: A Methodology for Efficient Training
Deep Learning on FPGAs: Past, Present, and Future
Deep learning review and its applications
Deep learning with COTS HPC systems
Deep Learning Workload Scheduling in GPU Datacenters: A Survey
Deep learning: A guide for practitioners in the physical sciences
Deep Neural Machine Translation with Weakly-Recurrent Units
Deep neural networks for direct, featureless learning through observation: the case of 2d spin models
Deep Neural Networks to Enable Real-time Multimessenger Astrophysics
Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups
Deep Shadow Maps from Volumetric Data on the GPU
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Deep Tensor Convolution on Multicores
Deep Voice 3: 2000-Speaker Neural Text-to-Speech
Deep Voice: Real-time Neural Text-to-Speech
Deep-Edge: An Efficient Framework for Deep Learning Model Update on Heterogeneous Edge
Deep, Big, Simple Neural Nets for Handwritten Digit Recognition
Deep, Dense, and Low-Rank Gaussian Conditional Random Fields
DeepAxe: A Framework for Exploration of Approximation and Reliability Trade-offs in DNN Accelerators
DeepBach: a Steerable Model for Bach chorales generation
DeepBE: Learning Deep Binary Encoding for Multi-Label Classification
DeepDSL: A Compilation-based Domain-Specific Language for Deep Learning
Titles: 100
open PDFs: 98
packages: 24