1173

Papers on hgpu.org (.txt-file)

Towards Adaptive GPU Resource Management for Embedded Real-Time Systems Download

Towards Alignment of Parallelism in SYCL and ISO C++ Download

Towards an automatic generation of dense linear algebra solvers on parallel architectures Download

Towards an Effective Unified Programming Model for Many-Cores Download

Towards an embedded biologically-inspired machine vision processor Download

Towards an interactive and automated script feature analysis of 3D scanned cuneiform tablets Download

Towards automated kernel selection in machine learning systems: A SYCL case study Download Package

Towards Automated Learning of Object Detectors Download

Towards Automatic C Programs Optimization and Parallelization using the PIPS-PoCC Integration Download Package

Towards automatic Digital Surface Model generation using a Graphics Processing Unit

Towards Automatic Learning of Heuristics for Mechanical Transformations of Procedural Code Download

Towards Automatic Transformation of Legacy Scientific Code into OpenCL for Optimal Performance on FPGAs Download Package

Towards Automating Multi-dimensional Data Decomposition for Executing a Single-GPU Code on a Multi-GPU System Download

Towards autonomous resource management: Deep learning prediction of CPU-GPU load balancing Download

Towards Building Error Resilient GPGPU Applications Download

Towards Chip-on-Chip Neuroscience: Fast Mining of Frequent Episodes Using Graphics Processors Download

Towards chip-on-chip neuroscience: fast mining of neuronal spike streams using graphics hardware Download

Towards Co-execution on Commodity Heterogeneous Systems: Optimizations for Time-Constrained Scenarios Download Package

Towards Code Generation from Design Models for Embedded Systems on Heterogeneous CPU-GPU Platforms Download

Towards Comprehensive Parametric Code Generation Targeting Graphics Processing Units in Support of Scientific Computation Download

Towards Dense Linear Algebra for Hybrid GPU Accelerated Manycore Systems Download

Towards Distortion-Predictable Embedding of Neural Networks Download Package

Towards Distributed Heterogenous High-Performance Computing with ViennaCL Download Package

Towards Domain-specific Computing for Stencil Codes in HPC Download Package

Towards dynamic reconfigurable load-balancing for hybrid desktop platforms Download

Towards Efficient and Scalable Acceleration of Online Decision Tree Learning on FPGA Download

Towards Efficient GPU Sharing on Multicore Processors Download

Towards Efficient Indexing of Spatiotemporal Trajectories on the GPU for Distance Threshold Similarity Searches Download

Towards Efficient Large-Scale Graph Neural Network Computing Download

Towards Efficient Risk Quantification-Using GPUs and Variance Reduction Technique Download

Towards energy efficiency and productivity for decision making in mobile robot navigation Download

Towards Enhancing Performance, Programmability, and Portability in Heterogeneous Computing Download Package

Towards fast and certified multiple-precision libraries Download

Towards Faster Cloth Simulation: Examining the Preconditioned Conjugate Gradient Download

Towards fully user transparent task and data parallel image processing Download

Towards global composition of performance-aware components for GPU-based systems Download

Towards Good Practices for Very Deep Two-Stream ConvNets Download Package

Towards GPGPU Assisted Computing in Virtualized Environments

Towards GPU-Accelerated Large-Scale Graph Processing in the Cloud Download

Towards Green Computing: A Survey of Performance and Energy Efficiency of Different Platforms using OpenCL Download

Towards High Performance Java-based Deep Learning Frameworks Download Package

Towards High Speed Aerial Tracking of Agile Targets Download

Towards High-Performance and Cost-Effective Distributed Storage Systems with Information Dispersal Algorithms Download

Towards Improving Programmability of Heterogeneous Parallel Architectures Download

Towards Intelligent Runtime Framework for Distributed Heterogeneous Systems Download

Towards Interactive Visual Exploration of Parallel Programs using a Domain-specific Language Download

Towards Large-Scale Molecular Dynamics Simulations on Graphics Processors Download

Towards large-scale network analytics Download

Towards Lattice Quantum Chromodynamics on FPGA devices Download

Towards making the most of NLP-based device mapping optimization for OpenCL kernels Download

Towards Memory-Efficient Answering of Tree-Shaped SPARQL Queries using GPUs Download Package

Towards metaprogramming for parallel systems on a chip Download

Towards microsecond biological molecular dynamics simulations on hybrid processors

Towards Modeling Energy Consumption of Xeon Phi Download

Towards multi-GPU support for visualization Download

Towards Multi-GPU Support in the Marrow Skeleton Framework Download

Towards On-Chip Optical FFTs for Convolutional Neural Networks Download

Towards On-Line Digital Doubles Download

Towards paradisEO-MO-GPU: a framework for GPU-based local search metaheuristics Download Package

Towards Parallel Programming Models for Predictability Download

Towards Path Tracing in Games Download

Towards Performance Portable Programming for Distributed Heterogeneous Systems Download Package

Towards Performance-Aware Allocation for Accelerated Machine Learning on GPU-SSD Systems Download

Towards Performance-Portable, Scalable, and Convenient Linear Algebra Download

Towards Portable Performance for Explicit Hydrodynamics Codes Download Package

Towards Porting a Real-World Seismological Application to the Intel MIC Architecture Download

Towards Predictable Real-Time Performance on Multi-Core Platforms Download

Towards Rapid Prototyping of Parallel and HPC Applications (GPU Focus) Download

Towards real time 2D to 3D registration for ultrasound-guided endoscopic and laparoscopic procedures Download

Towards real time 3D tracking and reconstruction on a GPU using Monte Carlo simulations Download

Towards real time vision based UUV navigation using GPU technology

Towards real-time radiation therapy: GPU accelerated superposition/convolution Download

Towards real-time tomography: Fast reconstruction algorithms and GPU implementation

Towards reverse engineering the brain: Modeling abstractions and simulation frameworks Download

Towards robust automatic detection of vulnerable road users: monocular pedestrian tracking from a moving vehicle Download

Towards scalar synchronization in SIMT architectures Download

Towards shared memory consistency models for GPUs Download

Towards smart-pixel-based implementation of wideband active sonar echolocation system for multi-target detection

Towards solving the Table Maker’s Dilemma on GPU Download

Towards Studying the Effect of Compiler Optimizations and Software Randomization on GPU Reliability Download

Towards systematic exploration of tradeoffs for medical image registration on heterogeneous platforms Download

Towards Understanding and Mitigating Memory-Access Challenges in Computing Systems Download Package

Towards Unified Analysis of GPU Consistency Download Package

Towards Unified INT8 Training for Convolutional Neural Network Download

Towards user transparent parallel multimedia computing on GPU-clusters Download

Towards Utilizing GPUs in Information Visualization: A Model and Implementation of Image-Space Operations Download

Towards Utilizing Remote GPUs for CUDA Program Execution Download

TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s Download

Track finding in ATLAS using GPUs Download

Tracking 3d Pose of Rigid Object by Sparse Template Matching

Tracking and Clustering Salient Features in Image Sequences Download

Tracking humans interacting with the environment using efficient hierarchical sampling and layered observation models Download

Tracking Many Solution Paths of a Polynomial Homotopy on a Graphics Processing Unit Download

Tradeoff analysis and optimization of power delivery networks with on-chip voltage regulation Download

Tradeoffs in designing accelerator architectures for visual computing Download

Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration Download Package

Training a Feedback Loop for Hand Pose Estimation Download Package

Training a Vision Transformer from scratch in less than 24 hours with 1 GPU Download

Training DNN Models over Heterogeneous Clusters with Optimal Performance Download

Training Logistic Regression and SVM on 200GB Data Using b-Bit Minwise Hashing and Comparisons with Vowpal Wabbit (VW) Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 93

Package packages: 18

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org