1173

Papers on hgpu.org (.txt-file)

Design of a fully programmable shader processor for low power mobile devices

Design of a Hybrid Memory System for General-Purpose Graphics Processing Units Download

Design of a parallel AES for graphics hardware using the CUDA framework Download

Design of a programmable micro-ultrasound research platform Download

Design of an FPGA-Based FDTD Accelerator Using OpenCL Download

Design of FPGA-Based Accelerator for Convolutional Neural Network under Heterogeneous Computing Framework with OpenCL Download

Design of Hardware Accelerator for Lempel-Ziv 4 (LZ4) Compression Download

Design of high-performance parallelized gene predictors in MATLAB Download

Design of MILC Lattice QCD Application for GPU Clusters Download

Design optimization of automotive electronic control unit using the analysis of common-mode current by fast electromagnetic field solver

Design Principles for Sparse Matrix Multiplication on the GPU Download

Design Space Exploration for GPU-Based Architecture Download

Design Space Exploration of an OpenCL Based SAXPY Kernel Implementation on FPGAs Download

Design Space Exploration of Concurrency Mapping to FPGAs in Weather and Climate Applications with Xilinx SDSoC OpenCL, SDSoC C++ and Vivad Download

Design Space Exploration of OpenCL Applications on Heterogeneous Parallel Platforms Download

Design Space Exploration of Real-time Bedside and Portable Medical Ultrasound Adaptive Beamformer Acceleration Download

Design space exploration towards a realtime and energy-aware GPGPU-based analysis of biosensor data Download

Design Tools for Accelerating Development and Usage of Multi-Core Computing Platforms Download

Design, Implementation and Performance Evaluation of a Stochastic Gradient Descent Algorithm on CUDA Download

Design, Implementation and Test of Efficient GPU to GPU Communication Methods Download

Design, Optimization, and Benchmarking of Dense Linear Algebra Algorithms on AMD GPUs Download

Designing a high-performance boundary element library with OpenCL and Numba Download Package

Designing a Modern Skeleton Programming Framework for Parallel and Heterogeneous Systems Download

Designing a Unified Programming Model for Heterogeneous Machines Download

Designing and optimizing compute kernels on NVIDIA GPUs

Designing Bit-Reproducible Portable High-Performance Applications Download

Designing Efficient Barriers and Semaphores for Graphics Processing Units Download

Designing Efficient Many-Core Parallel Algorithms for All-Pairs Shortest-Paths Using CUDA

Designing Efficient MPI and UPC Runtime for Multicore Clusters with InfiniBand, Accelerators and Co-Processors Download

Designing efficient sorting algorithms for manycore GPUs Download

Designing Fast Architecture Sensitive Tree Search on Modern Multi-Core/Many-Core Processors Download

Designing Fast LTL Model Checking Algorithms for Many-Core GPUs Download

Designing Numerical Solvers for Next Generation High Performance Computing Download

Designing OP2 for GPU architectures Download Package

Designing scalable many-core parallel algorithms for min graphs using CUDA

Designing Scientific Applications on GPUs Download Package

Designing the Language Liszt for Building Portable Mesh-based PDE Solvers Download Package

Detecting Computer Viruses using GPUs Download

Detecting Data Races on OpenCL Kernels with Symbolic Execution Download

Detecting multiple periodicities in observational data with the multi-frequency periodogram. II. Frequency Decomposer, a parallelized time-series analysis algorithm Download Package

Detecting parametric objects in large scenes by Monte Carlo sampling Download

Detection of a faint fast-moving near-Earth asteroid using synthetic tracking technique Download

Detection of collisions and self-collisions using image-space techniques Download

Detection of retransmissions in 10G Ethernet using GPUs Download

Determinant Computation on the GPU using the Condensation Method Download

Determining the difficulty of accelerating problems on a GPU Download

Deterministic Parallelism Download

Deterministic Sample Sort For GPUs Download

Developing a compiler for the XeonPhi Download

Developing a CUDA solver for large sparse matrices for MARIN Download

Developing a High Performance GPGPU Compiler Using Cetus Download Package

Developing a High Performance Software Library with MPI and CUDA for Matrix Computations Download

Developing a massive real-time crowd simulation framework on the GPU Download

Developing a New Storage Format and a Warp-Based SpMV Kernel for Configuration Interaction Sparse Matrices on the GPU Download

Developing acquisition systems based on FPGA with OpenCL Download

Developing an OO Model for Generalized Matrix Multiplication: Preliminary Considerations Download

Developing and Deploying Advanced Algorithms to Novel Supercomputing Hardware Download

Developing and Evaluating clOpenCL Applications for Heterogeneous Clusters Download

Developing Extensible Lattice-Boltzmann Simulators for General-Purpose Graphics-Processing Units Download

Developing Performance-Portable Molecular Dynamics Kernels in OpenCL Download Package

Development and evaluation of a GPU-optimized N-body term for the simulation of biomolecules Download

Development and evaluation of scalable video motion estimators on GPU

Development methodologies for GPU and cluster of GPUs Download

Development of a Chemically Reacting Flow Solver on the Graphic Processing Units Download

Development of a CUDA Implementation of the 3D FDTD Method Download

Development of a Flow Solver with Complex Kinetics on the Graphic Processing Units Download

Development of a GPU based two-way time transfer modem

Development of a GPU-accelerated MIKE 21 Solver for Water Wave Dynamics Download

Development of a GPU-based High-Performance Radiative Transfer Model for the Infrared Atmospheric Sounding Interferometer (IASI)

Development of a GPU-based Monte Carlo dose calculation code for coupled electron-photon transport Download

Development of a GPU-based multithreaded software application to calculate digitally reconstructed radiographs for radiotherapy Download

Development of a Restricted Additive Schwarz Preconditioner for Sparse Linear Systems on NVIDIA GPU Download

Development of a volume rendering system using 3D texture compression techniques on general-purpose personal computers Download

Development of an Algorithm for Extracting Parallelism and Pipeline Structure from Stream-based Processing flow with Spanning Tree Download

Development of an explicit pressure-based unstructured solver for three-dimensional incompressible flows with graphics hardware acceleration Download

Development of an unified FDTD-FEM library for electromagnetic analysis with CPU and GPU computing Download

Development of Bayesian analysis program for extraction of polarisation observables at CLAS Download Package

Development of Generic Scheduling Concepts for OpenGL ES 2.0 Download

Development of High-Performance Software Components for Emerging Architectures Download Package

Development of JavaScript-based deep learning platform and application to distributed training Download Package

Development of Krylov and AMG linear solvers for large-scale sparse matrices on GPUs Download

Development of methods for the processing of mining images using genetic algorithms Download

Development of nonlinear filter bank system for real-time beautification of facial video using GPGPU

Development of Parallel Architectures for Radar/Video Signal Processing Applications Download

Development of Parallel Computation Tools Download

Development of Virtual Machine Tool for Simulation and Evaluation Download

Developmental Directions in Parallel Accelerators Download

Device Placement Optimization with Reinforcement Learning Download

Device specialization in heterogeneous multi-GPU environments Download

Devito: automated fast finite difference computation Download Package

DFG Implementation on Multi GPU Cluster with Computation-Communication Overlap Download

DGEMM on Integer Matrix Multiplication Unit Download Package

Diagnosing Performance Bottlenecks in HPC Applications Download Package

Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method Download

Diagrammatic Determinantal Quantum Monte Carlo Calculations on GPUs Download

DIANNE: Distributed Artificial Neural Networks for the Internet of Things Download Package

Diderot: A Parallel DSL for Image Analysis and Visualization Download Package

Different Optimization Strategies and Performance Evaluation of Reduction on Multicore CUDA Architecture Download

Differential evolution algorithm on the GPU with C-CUDA

Differential Evolution with parallelised objective functions using CUDA Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 90

Package packages: 15

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: