1173

Papers on hgpu.org (.txt-file)

Delta-stepping: a parallelizable shortest path algorithm

DEM based simulation of concrete structures on GPU Download

DEMCMC-GPU: An Efficient Multi-Objective Optimization Method with GPU Acceleration on the Fermi Architecture

Democratic Population Decisions Result in Robust Policy-Gradient Learning: A Parametric Study with GPU Simulations Download Package

Democratizing General Purpose GPU Programming through OpenCL and Scala Download Package

Demonstrating Self-Learning Algorithm Adaptivity in a Hardware-Oblivious Database Engine Download

Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs Download Package

Demystifying Dependency Bugs in Deep Learning Stack Download

Demystifying GPU microarchitecture through microbenchmarking Download

Demystifying the MLPerf Benchmark Suite Download Package

Demystifying the Nvidia Ampere Architecture through Microbenchmarking and Instruction-level Analysis Download

Denoising Volumetric Data on GPU Download

Dense and sparse parallel linear algebra algorithms on graphics processing units Download

Dense Arithmetic over Finite Fields with the CUMODP Library Download Package

Dense Dynamic Programming on Multi GPU Download

Dense Linear Algebra on Distributed Heterogeneous Hardware with a Symbolic DAG Approach Download

Dense linear algebra solvers for multicore with GPU accelerators Download Package

Dense Matrix Algebra on the GPU Download

Dense Matrix Computation on a Heterogenous Architecture: A Block Synchronous Approach Download Package

Dense optical flow by iterative local window registration

Dense photometric stereo reconstruction on many core GPUs Download

Dense Photometric Stereo: A Markov Random Field Approach Download

Dense point trajectories by GPU-accelerated large displacement optical flow Download

Dense Real-Time Mapping of Object-Class Semantics from RGB-D Video Download

Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures Download

DenseCut: Densely Connected CRFs for Realtime GrabCut Download Package

Density Estimations for Approximate Query Processing on SIMD Architectures Download Package

Density functional theory calculation on many-cores hybrid central processing unit-graphic processing unit architectures Download

Density Functional Theory calculation on many-cores hybrid CPU-GPU architectures Download

Density-based clustering using graphics processors Download

Density-based parallel skin lesion border detection with webCL Download

Dependable Embedded Systems Download

Deploying Graph Algorithms on GPUs: an Adaptive Solution Download

Deployment of CPU and GPU-based genetic programming on heterogeneous devices

Deployment of parallel linear genetic programming using GPUs on PC and video game console platforms Download

Depth Enhanced Panoramas Download

Depth Estimation using Open Compute Language (OpenCL) Download

Depth Images: Representations and Real-Time Rendering Download

Depth Map Based Superresolution Method in 3D Reconstruction Download

Depth map enhanced macroblock partitioning for H.264 video coding of computer graphics content Download

Depth-Dependent Halos: Illustrative Rendering of Dense Line Data Download

Depth-First Search versus Jurema Search on GPU Branch-and-Bound Algorithms: a case study Download

Depth-of-Field Blur Effects for First-Person Navigation in Virtual Environments Download

Deriving Shape Grammars on the GPU Download

Descend: A Safe GPU Systems Programming Language Download Package

Design and Analysis of Soft-Error Resilience Mechanisms for GPU Register File Download

Design and Development of an Efficient H. 264 Video Encoder for CPU/GPU using OpenCL Download

Design and Development of Optical Flow Based Obstacle Avoidance Using CUDA Download

Design and evaluation of a parallel k-nearest neighbor algorithm on CUDA-enabled GPU

Design and Evaluation of Scalable Concurrent Queues for Many-Core Architectures Download

Design and implementation of a high-performance stream-based computing platform on multigenerational GPUs Download

Design and Implementation of a PTX Emulation Library Download

Design and implementation of a time-division multiplexing scan architecture using serializer and deserializer in GPU chips

Design and Implementation of Centrally-Coordinated Peer-to-Peer Live-streaming Download

Design and Implementation of CNN-FPGA accelerator based on Open Computing Language Download Package

Design and Implementation of GPU-Based Prim’s Algorithm Download

Design and implementation of MPEG audio layer III decoder using graphics processing units

Design and Implementation of ShenWei Universal C/C++ Download

Design and implementation of software-managed caches for multicores with local memory Download

Design and Implementation of the Futhark Programming Language Download Package

Design and implementation of the Smith-Waterman algorithm on the CUDA-compatible GPU Download

Design and Modeling of a Non-blocking Checkpointing System Download

Design and optimization of a portable LQCD Monte Carlo code using OpenACC Download

Design and optimization of DBSCAN Algorithm based on CUDA Download

Design and Optimization of Hybrid MD5-Blowfish Encryption on GPUs Download

Design and Optimization of Image Processing Algorithms on Mobile GPU Download

Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms Download

Design and Optimization of OpenFOAM-based CFD Applications for Modern Hybrid and Heterogeneous HPC Platforms Download

Design and Performance Analysis of Parallel Processing of SRTP Packets Download

Design and performance evaluation of a digital wideband receiver on a hybrid computing platform

Design and Performance Evaluation of a Software Framework for Multi-Physics Simulations on Heterogeneous Supercomputers Download

Design and Performance Evaluation of Image Processing Algorithms on GPUs

Design and Performance Evaluation of Optimizations for OpenCL FPGA Kernels Download

Design and Performance of the OP2 Library for Unstructured Mesh Applications Download Package

Design and Storage Optimization of GPU-based Parallel Program of Image Registration for Remote Sensing Download

Design and study of a massively multi threaded shared memory architecture Download

Design Exploration of AES Accelerators on FPGAs and GPUs Download

Design Exploration of Quadrature Methods in Option Pricing Download

Design of 3D FFT on Multi-GPU Clusters Download

Design of a fully programmable shader processor for low power mobile devices

Design of a Hybrid Memory System for General-Purpose Graphics Processing Units Download

Design of a parallel AES for graphics hardware using the CUDA framework Download

Design of a programmable micro-ultrasound research platform Download

Design of an FPGA-Based FDTD Accelerator Using OpenCL Download

Design of FPGA-Based Accelerator for Convolutional Neural Network under Heterogeneous Computing Framework with OpenCL Download

Design of Hardware Accelerator for Lempel-Ziv 4 (LZ4) Compression Download

Design of high-performance parallelized gene predictors in MATLAB Download

Design of MILC Lattice QCD Application for GPU Clusters Download

Design optimization of automotive electronic control unit using the analysis of common-mode current by fast electromagnetic field solver

Design Principles for Sparse Matrix Multiplication on the GPU Download

Design Space Exploration for GPU-Based Architecture Download

Design Space Exploration of an OpenCL Based SAXPY Kernel Implementation on FPGAs Download

Design Space Exploration of Concurrency Mapping to FPGAs in Weather and Climate Applications with Xilinx SDSoC OpenCL, SDSoC C++ and Vivad Download

Design Space Exploration of OpenCL Applications on Heterogeneous Parallel Platforms Download

Design Space Exploration of Real-time Bedside and Portable Medical Ultrasound Adaptive Beamformer Acceleration Download

Design space exploration towards a realtime and energy-aware GPGPU-based analysis of biosensor data Download

Design Tools for Accelerating Development and Usage of Multi-Core Computing Platforms Download

Design, Implementation and Performance Evaluation of a Stochastic Gradient Descent Algorithm on CUDA Download

Design, Implementation and Test of Efficient GPU to GPU Communication Methods Download

Design, Optimization, and Benchmarking of Dense Linear Algebra Algorithms on AMD GPUs Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 89

Package packages: 13

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org