Papers on hgpu.org (.txt-file)
DeeperLab: Single-Shot Image Parser
DeepfakeUCL: Deepfake Detection via Unsupervised Contrastive Learning
DeepFont: Identify Your Font from An Image
DeepLearningKit – an GPU Optimized Deep Learning Framework for Apple’s iOS, OS X and tvOS developed in Metal and Swift
DeepLearningKit – an Open Source Deep Learning Framework for Apple’s iOS, OS X and tvOS developed in Metal and Swift
DeepMetabolism: A Deep Learning System to Predict Phenotype from Genome Sequencing
DeepMon: Mobile GPU-based Deep Learning Framework for Continuous Vision Applications
DeepProf: Performance Analysis for Deep Learning Applications via Mining GPU Execution Patterns
DeepPy: Pythonic deep learning
DeepSeek-Coder: When the Large Language Model Meets Programming – The Rise of Code Intelligence
DeepSmith: Compiler Fuzzing through Deep Learning
DeepSpark: Spark-Based Deep Learning Supporting Asynchronous Updates and Caffe Compatibility
DeepSpeech: Scaling up end-to-end speech recognition
DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression
DeepX: A Software Accelerator for Low-Power Deep Learning Inference on Mobile Devices
DEF-G: Declarative Framework for GPU Environment
Defocus Magnification with CUDA
Deformable model collision detection using A-buffer
Deformable object simulation in virtual environment
Deformation modeling using global medial representation structures and evaluation by biset mesh matching
Deformation of skeleton based implicit objects
Deforming a High-Resolution Mesh in Real-Time by Mapping onto a Low-Resolution Physical Model
Delaunay Triangulation in R3 on the GPU
Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks
Delta-stepping: a parallelizable shortest path algorithm
DEM based simulation of concrete structures on GPU
Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations
Democratizing General Purpose GPU Programming through OpenCL and Scala
Demonstrating Self-Learning Algorithm Adaptivity in a Hardware-Oblivious Database Engine
Demystifying Dependency Bugs in Deep Learning Stack
Demystifying GPU microarchitecture through microbenchmarking
Demystifying the MLPerf Benchmark Suite
Demystifying the Nvidia Ampere Architecture through Microbenchmarking and Instruction-level Analysis
Denoising Volumetric Data on GPU
Dense and sparse parallel linear algebra algorithms on graphics processing units
Dense Arithmetic over Finite Fields with the CUMODP Library
Dense Dynamic Programming on Multi GPU
Dense Linear Algebra on Distributed Heterogeneous Hardware with a Symbolic DAG Approach
Dense linear algebra solvers for multicore with GPU accelerators
Dense Matrix Algebra on the GPU
Dense Matrix Computation on a Heterogenous Architecture: A Block Synchronous Approach
Dense optical flow by iterative local window registration
Dense photometric stereo reconstruction on many core GPUs
Dense Photometric Stereo: A Markov Random Field Approach
Dense point trajectories by GPU-accelerated large displacement optical flow
Dense Real-Time Mapping of Object-Class Semantics from RGB-D Video
Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures
DenseCut: Densely Connected CRFs for Realtime GrabCut
Density Estimations for Approximate Query Processing on SIMD Architectures
Density functional theory calculation on many-cores hybrid central processing unit-graphic processing unit architectures
Density Functional Theory calculation on many-cores hybrid CPU-GPU architectures
Density-based clustering using graphics processors
Density-based parallel skin lesion border detection with webCL
Deploying Graph Algorithms on GPUs: an Adaptive Solution
Deployment of CPU and GPU-based genetic programming on heterogeneous devices
Deployment of parallel linear genetic programming using GPUs on PC and video game console platforms
Depth Estimation using Open Compute Language (OpenCL)
Depth Images: Representations and Real-Time Rendering
Depth Map Based Superresolution Method in 3D Reconstruction
Depth map enhanced macroblock partitioning for H.264 video coding of computer graphics content
Depth-Dependent Halos: Illustrative Rendering of Dense Line Data
Depth-First Search versus Jurema Search on GPU Branch-and-Bound Algorithms: a case study
Depth-of-Field Blur Effects for First-Person Navigation in Virtual Environments
Deriving Shape Grammars on the GPU
Descend: A Safe GPU Systems Programming Language
Design and Analysis of Soft-Error Resilience Mechanisms for GPU Register File
Design and Development of an Efficient H. 264 Video Encoder for CPU/GPU using OpenCL
Design and Development of Optical Flow Based Obstacle Avoidance Using CUDA
Design and evaluation of a parallel k-nearest neighbor algorithm on CUDA-enabled GPU
Design and Evaluation of Scalable Concurrent Queues for Many-Core Architectures
Design and implementation of a high-performance stream-based computing platform on multigenerational GPUs
Design and Implementation of a PTX Emulation Library
Design and Implementation of Centrally-Coordinated Peer-to-Peer Live-streaming
Design and Implementation of CNN-FPGA accelerator based on Open Computing Language
Design and Implementation of GPU-Based Prim’s Algorithm
Design and implementation of MPEG audio layer III decoder using graphics processing units
Design and Implementation of ShenWei Universal C/C++
Design and implementation of software-managed caches for multicores with local memory
Design and Implementation of the Futhark Programming Language
Design and implementation of the Smith-Waterman algorithm on the CUDA-compatible GPU
Design and Modeling of a Non-blocking Checkpointing System
Design and optimization of a portable LQCD Monte Carlo code using OpenACC
Design and optimization of DBSCAN Algorithm based on CUDA
Design and Optimization of Hybrid MD5-Blowfish Encryption on GPUs
Design and Optimization of Image Processing Algorithms on Mobile GPU
Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms
Design and Optimization of OpenFOAM-based CFD Applications for Modern Hybrid and Heterogeneous HPC Platforms
Design and Performance Analysis of Parallel Processing of SRTP Packets
Design and performance evaluation of a digital wideband receiver on a hybrid computing platform
Design and Performance Evaluation of a Software Framework for Multi-Physics Simulations on Heterogeneous Supercomputers
Design and Performance Evaluation of Image Processing Algorithms on GPUs
Design and Performance Evaluation of Optimizations for OpenCL FPGA Kernels
Design and Performance of the OP2 Library for Unstructured Mesh Applications
Design and Storage Optimization of GPU-based Parallel Program of Image Registration for Remote Sensing
Design and study of a massively multi threaded shared memory architecture
Design Exploration of AES Accelerators on FPGAs and GPUs
Titles: 100
open PDFs: 90
packages: 21