1173

Papers on hgpu.org (.txt-file)

TrimZero: A Torch Recurrent Module for Efficient Natural Language Processing Download Package

triSYCL for Xilinx FPGA Download Package

Triton: An Intermediate Language and Compiler for Tiled Neural Network Computations Download Package

True 4D Image Denoising on the GPU Download

TTC: A Tensor Transposition Compiler for Multiple Architectures Download Package

TuCCompi: A Multi-Layer Programing Model for Heterogeneous Systems with Auto-Tuning Capabilities Download

Tuned and asynchronous stencil kernels for CPU/GPU systems (thesis) Download

Tuned and GPU-accelerated parallel data mining from comparable corpora Download

Tuned and wildly asynchronous stencil kernels for hybrid CPU/GPU systems Download

Tuning a Finite Difference Computation for Parallel Vector Processors Download

Tuning A Hybrid GPU-CPU V-cycle Multilevel Preconditioner for Solving Large Real and Complex Systems of FEM Equations

Tuning Manifold Harmonics Filters Download

Tuning Stencil Codes in OpenCL for FPGAs Download Package

Tuning Streamed Applications on Intel Xeon Phi: A Machine Learning Based Approach Download Package

Turbo Bayesian Compressed Sensing Download

Tutorial 3: Methodologies and Performance Impacts of General Purpose Computing on GPUs

TVM: An Automated End-to-End Optimizing Compiler for Deep Learning Download Package

TVM: End-to-End Optimization Stack for Deep Learning Download Package

Twin peaks: a software platform for heterogeneous computing on general-purpose and graphics processors

Two Algorithms for Sorting On Heterogeneous Clusters Download

Two Approaches to Particle Simulation: OpenMPI and CUDA Download

Two improved GPU acceleration strategies for force-directed graph layout

Two Level Approach to Efficient Visualization of Protein Dynamics Download

Two Simple Single-pass GPU methods for Multi-channel Surface Voxelization of Dynamic Scenes Download

Two Stage Data Mining Technique for Fast Monsoon Onset Prediction Download

Two-electron integral evaluation on the graphics processor unit Download

Two-fluid compressible simulations on GPU cluster Download

Two-Level Approach to Efficient Visualization of Protein Dynamics

Two-stage compression for fast volume rendering of time-varying scalar data Download

Two-way partitioning of a recursive Gaussian filter in CUDA Download

Two-Way Real Time Fluid Simulation Using a Heterogeneous Multicore CPU and GPU Architecture

TWQCD’s dynamical DWF project Download

Type-safe Runtime Code Generation: Accelerate to LLVM Download Package

U-Net: Convolutional Networks for Biomedical Image Segmentation Download Package

UAV Path Planning with Parallel Genetic Algorithms on CUDA Architecture Download

uBench: Performance Impact of CUDA Block Geometry Download

UberFlow: a GPU-based particle engine Download

Ubiquitous Parallel Computing from Berkeley, Illinois, and Stanford Download

UCHPC – UnConventional High Performance Computing for Finite Element Simulations Download

Ultra-Fast Detection of Higher-Order Epistatic Interactions on GPUs Download

Ultra-Fast Displaying Spectral Domain Optical Doppler Tomography System Using a Graphics Processing Unit Download

Ultra-fast FFT protein docking on graphics processors Download Package

Ultra-Fast Hybrid CPU-GPU Multiple Scatter Simulation for 3D PET Download

Ultra-fast treatment plan optimization for volumetric modulated arc therapy (VMAT) Download

Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml Download Package

Ultrasound goes GPU: real-time simulation using CUDA Download

Ultrasound Image Simulation with GPU-based Ray Tracing Download

Uncertainty-Aware Guided Volume Segmentation Download

Uncluttering Graph Layouts Using Anisotropic Diffusion and Mass Transport Download

Uncontracted Rys Quadrature Implementation of up to G Functions on Graphical Processing Units

Under the Hood of SYCL – An Initial Performance Analysis With an Unstructured-mesh CFD Application Download Package

Understanding and Modeling the Synchronization Cost in the GPU Architecture Download

Understanding Data Movement in AMD Multi-GPU Systems with Infinity Fabric Download

Understanding GPU Programming for Statistical Computation: Studies in Massively Parallel Massive Mixtures Download Package

Understanding GPU Triggering APIs for MPI+X Communication Download

Understanding GPU-Based Lossy Compression for Extreme-Scale Cosmological Simulations Download Package

Understanding Latency Hiding on GPUs Download

Understanding Performance Portability of Bioinformatics Applications in SYCL on an NVIDIA GPU Download Package

Understanding Protein Dynamics with L1-Regularized Reversible Hidden Markov Models Download

Understanding software approaches for GPGPU reliability Download

Understanding the Costs of Many-Task Computing Workloads on Intel Xeon Phi Coprocessors Download

Understanding the design trade-offs among current multicore systems for numerical computations Download

Understanding the efficiency of GPU algorithms for matrix-matrix multiplication Download

Understanding the efficiency of parallel incomplete Cholesky preconditioners on the performance of ICCG solvers for multi-core and GPU systems

Understanding the efficiency of ray traversal on GPUs Download

Understanding the impact of CUDA tuning techniques for Fermi Download

Understanding the Impact of Hybrid Programming on Software Energy Efficiency Download

Understanding the Impact of Input Entropy on FPU, CPU, and GPU Power Download Package

Understanding the ISA impact on GPU Architecture Download

Understanding the Performance of HPC Applications Download

Understanding the Power of Evolutionary Computation for GPU Code Optimization Download Package

Understanding the SIMD Efficiency of Graph Traversal on GPU Download

Understanding the Topics and Challenges of GPU Programming by Classifying and Analyzing Stack Overflow Posts Download

Unfolding and Shrinking Neural Machine Translation Ensembles Download Package

UNICORN: A Bulk Synchronous Programming Model, Framework and Runtime for Hybrid CPU-GPU Clusters Download Package

Unified – A Sharp Turn in the Latest Era of Graphic Processors Download

Unified Deep Learning with CPU, GPU, and FPGA Technologies Download

Unified Development for Mixed Multi-GPU and Multi-Coprocessor Environments using a Lightweight Runtime Environment Download

Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation Download

Unified Particle Physics for Real-Time Applications Download

Unified Shader Programming in C++ Download

Unified Shared Memory: Friend or Foe? Download

Unified system of code transformation and execution for heterogeneous multi-core architectures Download

Unified Tables for Exponential and Logarithm Families Download

UniFL: Accelerating Federated Learning Using Heterogeneous Hardware Under a Unified Framework Download

Uniform partitioning of Monte Carlo radiosity on GPUs

Unifying stream based and reconfigurable computing to design application accelerators Download

Unleashing the Power of Distributed CPU/GPU Architectures: Massive Astronomical Data Analysis and Visualization case study Download

Unlocking Bandwidth for GPUs in CC-NUMA Systems Download

Unsafe Floating-point to Unsigned Integer Casting Check for GPU Programs Download

Unstructured grid applications on GPU: performance analysis and improvement Download

Unsupervised Asset Cluster Analysis Implemented with Parallel Genetic Algorithms on the NVIDIA CUDA Platform Download

Unsupervised Deep Learning of Incompressible Fluid Dynamics Download

Unsupervised Markovian Segmentation on Graphics Hardware Download

Up to 700k GPU cores, Kepler, and the Exascale future for simulations of star clusters around black holes Download Package

UPC on MIC: Early Experiences with Native and Symmetric Modes Download

Urban Regional Seismic Damage Prediction Based On GPU-CPU Hybrid Computing Download

Usable assembly language for GPUs: a success story Download

Usage of GPU in LS-DYNA Download

Use NVIDIA CUDA technology to create genetic algorithms with extensive population Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 91

Package packages: 21

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: