1173

Papers on hgpu.org (.txt-file)

Track finding in ATLAS using GPUs Download

Tracking 3d Pose of Rigid Object by Sparse Template Matching

Tracking and Clustering Salient Features in Image Sequences Download

Tracking humans interacting with the environment using efficient hierarchical sampling and layered observation models Download

Tracking Many Solution Paths of a Polynomial Homotopy on a Graphics Processing Unit Download

Tradeoff analysis and optimization of power delivery networks with on-chip voltage regulation Download

Tradeoffs in designing accelerator architectures for visual computing Download

Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration Download Package

Training a Feedback Loop for Hand Pose Estimation Download Package

Training a Vision Transformer from scratch in less than 24 hours with 1 GPU Download

Training DNN Models over Heterogeneous Clusters with Optimal Performance Download

Training Logistic Regression and SVM on 200GB Data Using b-Bit Minwise Hashing and Comparisons with Vowpal Wabbit (VW) Download

Training Neural Networks Without Gradients: A Scalable ADMM Approach Download

Tranformation of CPU-based Applications To Leverage on Graphics Processors using CUDA Download

TransAxx: Efficient Transformers with Approximate Computing Download Package

TransCAIP: A Live 3D TV System Using a Camera Array and an Integral Photography Display with Interactive Control of Viewing Parameters Download

TransCL: An Automatic CUDA-to-OpenCL Programs Transformation Framework Download Package

Transfer Time Reduction of Data Transfers between CPU and GPU Download

Transform Coding for Hardware-accelerated Volume Rendering Download

Transformation of Scientific Algorithms to Parallel Computing Code: Single GPU and MPI multi GPU Backends with Subdomain Support Download

Transformations of High-Level Synthesis Codes for High-Performance Computing Download

Transforming and Optimizing Irregular Applications for Parallel Architectures Download

Transforming C OpenMP Programs for Verification in CIVL Download

Translating GPU binaries to tiered SIMD architectures with Ocelot Download

Translating OpenMP Device Constructs to OpenCL using Unnecessary Data Transfer Elimination Download

Translation-invariant two-dimensional discrete wavelet transform on graphics processing units Download

Transparent Acceleration for Heterogeneous Platforms With Compilation to OpenCL Download

Transparent Acceleration of Java-based Deep Learning Engines Download Package

Transparent Accelerator Migration in a Virtualized GPU Environment Download

Transparent Checkpoint-Restart for Hardware-Accelerated 3D Graphics Download

Transparent Checkpointing for OpenGL Applications on GPUs Download

Transparent Compiler and Runtime Specializations for Accelerating Managed Languages on FPGAs Download Package

Transparent CPU-GPU Collaboration for Data-Parallel Kernels on Heterogeneous Systems Download

Transparent FPGA Acceleration with TensorFlow Download Package

Transparent use of Java objects on the GPU in the JaMP/OpenMP framework Download

Trapping of giant-planet cores – I. vortex aided trapping at the outer dead zone edge Download Package

Tree Structured Analysis on GPU Power Study Download

Treecode and fast multipole method for N-body simulation with CUDA Download Package

TREES: A CPU/GPU Task-Parallel Runtime with Explicit Epoch Synchronization Download

Trellis: Portability Across Architectures with a High-level Framework Download

Tri-Hybrid Computational Fluid Dynamics on DOE’s Cray XK7, Titan Download

Triangular matrix inversion on Graphics Processing Unit

Triangular mesh simplification on the GPU Download

Tridiagonalization of a dense symmetric matrix on multiple GPUs and its application to symmetric eigenvalue problems Download

Trie Compression for GPU Accelerated Multi-Pattern Matching Download

TrimZero: A Torch Recurrent Module for Efficient Natural Language Processing Download Package

triSYCL for Xilinx FPGA Download Package

Triton-Sanitizer: A Fast and Device-Agnostic Memory Sanitizer for Triton with Rich Diagnostic Context Download Package

Triton: An Intermediate Language and Compiler for Tiled Neural Network Computations Download Package

TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators Download Package

tritonBLAS: Triton-based Analytical Approach for GEMM Kernel Parameter Selection Download Package

TritonForge: Profiling-Guided Framework for Automated Triton Kernel Optimization Download Package

True 4-Bit Quantized Convolutional Neural Network Training on CPU: Achieving Full-Precision Parity Download Package

True 4D Image Denoising on the GPU Download

TRUST: the HPC open-source CFD platform – from CPU to GPU Download Package

TTC: A Tensor Transposition Compiler for Multiple Architectures Download Package

TuCCompi: A Multi-Layer Programing Model for Heterogeneous Systems with Auto-Tuning Capabilities Download

Tuned and asynchronous stencil kernels for CPU/GPU systems (thesis) Download

Tuned and GPU-accelerated parallel data mining from comparable corpora Download

Tuned and wildly asynchronous stencil kernels for hybrid CPU/GPU systems Download

Tuning a Finite Difference Computation for Parallel Vector Processors Download

Tuning A Hybrid GPU-CPU V-cycle Multilevel Preconditioner for Solving Large Real and Complex Systems of FEM Equations

Tuning Manifold Harmonics Filters Download

Tuning Stencil Codes in OpenCL for FPGAs Download Package

Tuning Streamed Applications on Intel Xeon Phi: A Machine Learning Based Approach Download Package

Turbo Bayesian Compressed Sensing Download

Tutorial 3: Methodologies and Performance Impacts of General Purpose Computing on GPUs

Tutoring LLM into a Better CUDA Optimizer Download Package

TVM: An Automated End-to-End Optimizing Compiler for Deep Learning Download Package

TVM: End-to-End Optimization Stack for Deep Learning Download Package

Twin peaks: a software platform for heterogeneous computing on general-purpose and graphics processors

Two Algorithms for Sorting On Heterogeneous Clusters Download

Two Approaches to Particle Simulation: OpenMPI and CUDA Download

Two improved GPU acceleration strategies for force-directed graph layout

Two Level Approach to Efficient Visualization of Protein Dynamics Download

Two Simple Single-pass GPU methods for Multi-channel Surface Voxelization of Dynamic Scenes Download

Two Stage Data Mining Technique for Fast Monsoon Onset Prediction Download

Two-electron integral evaluation on the graphics processor unit Download

Two-fluid compressible simulations on GPU cluster Download

Two-Level Approach to Efficient Visualization of Protein Dynamics

Two-stage compression for fast volume rendering of time-varying scalar data Download

Two-way partitioning of a recursive Gaussian filter in CUDA Download

Two-Way Real Time Fluid Simulation Using a Heterogeneous Multicore CPU and GPU Architecture

TWQCD’s dynamical DWF project Download

Type-safe Runtime Code Generation: Accelerate to LLVM Download Package

U-Net: Convolutional Networks for Biomedical Image Segmentation Download Package

UAV Path Planning with Parallel Genetic Algorithms on CUDA Architecture Download

uBench: Performance Impact of CUDA Block Geometry Download

UberFlow: a GPU-based particle engine Download

Ubiquitous Parallel Computing from Berkeley, Illinois, and Stanford Download

UCHPC – UnConventional High Performance Computing for Finite Element Simulations Download

Ultra-Fast Detection of Higher-Order Epistatic Interactions on GPUs Download

Ultra-Fast Displaying Spectral Domain Optical Doppler Tomography System Using a Graphics Processing Unit Download

Ultra-fast FFT protein docking on graphics processors Download Package

Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET Download

Ultra-fast treatment plan optimization for volumetric modulated arc therapy (VMAT) Download

Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml Download Package

Ultrasound goes GPU: real-time simulation using CUDA Download

Ultrasound Image Simulation with GPU-based Ray Tracing Download

Uncertainty-Aware Guided Volume Segmentation Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 92

Package packages: 28

* * *

* * *

HGPU group © 2010-2026 hgpu.org

All rights belong to the respective authors

Contact us: