Papers on hgpu.org (.txt-file)
Deep Graph Learning for Program Analysis and System Optimization
Deep Graph Library Optimizations for Intel(R) x86 Architecture
Deep Language Models for Software Testing and Optimisation
Deep Learning and Machine Learning with GPGPU and CUDA: Unlocking the Power of Parallel Computing
Deep Learning Application in Plant Stress Imaging: A Review
Deep Learning Approaches to Source Code Analysis for Optimization of Heterogeneous Systems: Recent Results, Challenges and Opportunities
Deep Learning At Scale and At Ease
Deep Learning Based FPGA-CPU Acceleration
Deep Learning by Doing: The NVIDIA Deep Learning Institute and University Ambassador Program
Deep Learning for Computational Chemistry
Deep Learning for Computer Vision: A comparison between Convolutional Neural Networks and Hierarchical Temporal Memories on object recognition tasks
Deep Learning for Digital Asset Limit Order Books
Deep learning for galaxy surface brightness profile fitting
Deep Learning for Mortgage Risk
Deep Learning for Obfuscated Code Analysis
Deep Learning For Smile Recognition
Deep Learning in the Automotive Industry: Applications and Tools
Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls
Deep Learning Models on CPUs: A Methodology for Efficient Training
Deep Learning on FPGAs: Past, Present, and Future
Deep learning review and its applications
Deep learning with COTS HPC systems
Deep Learning Workload Scheduling in GPU Datacenters: A Survey
Deep learning: A guide for practitioners in the physical sciences
Deep Neural Machine Translation with Weakly-Recurrent Units
Deep neural networks for direct, featureless learning through observation: the case of 2d spin models
Deep Neural Networks to Enable Real-time Multimessenger Astrophysics
Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups
Deep Shadow Maps from Volumetric Data on the GPU
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Deep Tensor Convolution on Multicores
Deep Voice 3: 2000-Speaker Neural Text-to-Speech
Deep Voice: Real-time Neural Text-to-Speech
Deep-Edge: An Efficient Framework for Deep Learning Model Update on Heterogeneous Edge
Deep, Big, Simple Neural Nets for Handwritten Digit Recognition
Deep, Dense, and Low-Rank Gaussian Conditional Random Fields
DeepAxe: A Framework for Exploration of Approximation and Reliability Trade-offs in DNN Accelerators
DeepBach: a Steerable Model for Bach chorales generation
DeepBE: Learning Deep Binary Encoding for Multi-Label Classification
DeepDSL: A Compilation-based Domain-Specific Language for Deep Learning
DeeperLab: Single-Shot Image Parser
DeepfakeUCL: Deepfake Detection via Unsupervised Contrastive Learning
DeepFont: Identify Your Font from An Image
DeepLearningKit – an GPU Optimized Deep Learning Framework for Apple’s iOS, OS X and tvOS developed in Metal and Swift
DeepLearningKit – an Open Source Deep Learning Framework for Apple’s iOS, OS X and tvOS developed in Metal and Swift
DeepMetabolism: A Deep Learning System to Predict Phenotype from Genome Sequencing
DeepMon: Mobile GPU-based Deep Learning Framework for Continuous Vision Applications
DeepProf: Performance Analysis for Deep Learning Applications via Mining GPU Execution Patterns
DeepPy: Pythonic deep learning
DeepSeek-Coder: When the Large Language Model Meets Programming – The Rise of Code Intelligence
DeepSmith: Compiler Fuzzing through Deep Learning
DeepSpark: Spark-Based Deep Learning Supporting Asynchronous Updates and Caffe Compatibility
DeepSpeech: Scaling up end-to-end speech recognition
DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression
DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices
DEF-G: Declarative Framework for GPU Environment
Defocus Magnification with CUDA
Deformable model collision detection using A-buffer
Deformable object simulation in virtual environment
Deformation modeling using global medial representation structures and evaluation by biset mesh matching
Deformation of skeleton based implicit objects
Deforming a High-Resolution Mesh in Real-Time by Mapping onto a Low-Resolution Physical Model
Delaunay Triangulation in R3 on the GPU
Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks
Delta-stepping: a parallelizable shortest path algorithm
DEM based simulation of concrete structures on GPU
Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations
Democratizing General Purpose GPU Programming through OpenCL and Scala
Demonstrating Self-Learning Algorithm Adaptivity in a Hardware-Oblivious Database Engine
Demystifying Dependency Bugs in Deep Learning Stack
Demystifying GPU microarchitecture through microbenchmarking
Demystifying the MLPerf Benchmark Suite
Demystifying the Nvidia Ampere Architecture through Microbenchmarking and Instruction-level Analysis
Denoising Volumetric Data on GPU
Dense and sparse parallel linear algebra algorithms on graphics processing units
Dense Arithmetic over Finite Fields with the CUMODP Library
Dense Dynamic Programming on Multi GPU
Dense Linear Algebra on Distributed Heterogeneous Hardware with a Symbolic DAG Approach
Dense linear algebra solvers for multicore with GPU accelerators
Dense Matrix Algebra on the GPU
Dense Matrix Computation on a Heterogenous Architecture: A Block Synchronous Approach
Dense optical flow by iterative local window registration
Dense photometric stereo reconstruction on many core GPUs
Dense Photometric Stereo: A Markov Random Field Approach
Dense point trajectories by GPU-accelerated large displacement optical flow
Dense Real-Time Mapping of Object-Class Semantics from RGB-D Video
Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures
DenseCut: Densely Connected CRFs for Realtime GrabCut
Density Estimations for Approximate Query Processing on SIMD Architectures
Density functional theory calculation on many-cores hybrid central processing unit-graphic processing unit architectures
Density Functional Theory calculation on many-cores hybrid CPU-GPU architectures
Density-based clustering using graphics processors
Density-based parallel skin lesion border detection with webCL
Deploying Graph Algorithms on GPUs: an Adaptive Solution
Titles: 100
open PDFs: 96
packages: 30