Papers on hgpu.org (.txt-file)
Understanding the ISA impact on GPU Architecture

Understanding the Landscape of Ampere GPU Memory Errors

Understanding the Performance of HPC Applications

Understanding the Power of Evolutionary Computation for GPU Code Optimization

Understanding the SIMD Efficiency of Graph Traversal on GPU

Understanding the Topics and Challenges of GPU Programming by Classifying and Analyzing Stack Overflow Posts

Unfolding and Shrinking Neural Machine Translation Ensembles

UNICORN: A Bulk Synchronous Programming Model, Framework and Runtime for Hybrid CPU-GPU Clusters

Unified – A Sharp Turn in the Latest Era of Graphic Processors

Unified Deep Learning with CPU, GPU, and FPGA Technologies

Unified Development for Mixed Multi-GPU and Multi-Coprocessor Environments using a Lightweight Runtime Environment

Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation

Unified Particle Physics for Real-Time Applications

Unified schemes for directive-based GPU offloading

Unified Shader Programming in C++

Unified Shared Memory: Friend or Foe?

Unified system of code transformation and execution for heterogeneous multi-core architectures

Unified Tables for Exponential and Logarithm Families

UniFL: Accelerating Federated Learning Using Heterogeneous Hardware Under a Unified Framework

Uniform partitioning of Monte Carlo radiosity on GPUs
Unifying stream based and reconfigurable computing to design application accelerators

Unleashing the Power of Distributed CPU/GPU Architectures: Massive Astronomical Data Analysis and Visualization case study

Unlocking Bandwidth for GPUs in CC-NUMA Systems

Unsafe Floating-point to Unsigned Integer Casting Check for GPU Programs

Unstructured grid applications on GPU: performance analysis and improvement

Unsupervised Asset Cluster Analysis Implemented with Parallel Genetic Algorithms on the NVIDIA CUDA Platform

Unsupervised Deep Learning of Incompressible Fluid Dynamics

Unsupervised Markovian Segmentation on Graphics Hardware

Up to 700k GPU cores, Kepler, and the Exascale future for simulations of star clusters around black holes

UPC on MIC: Early Experiences with Native and Symmetric Modes

Urban Regional Seismic Damage Prediction Based On GPU-CPU Hybrid Computing

Usable assembly language for GPUs: a success story

Use NVIDIA CUDA technology to create genetic algorithms with extensive population

Use of Checkpoint-Restart for Complex HEP Software on Traditional Architectures and Intel MIC

Use of CUDA for the Continuous Space Language Model

Use of CUDA Parallel Computing Technology in Modeling of Solid Mineral Deposits

Use of FPGA or GPU-based architectures for remotely sensed hyperspectral image processing

Use of modern GPUs in Design Optimization

Use of Multi-GPU Systems for Larger Than Device FFTs: With Applications in Ultrasound Simulations

Use of Multiple GPUs on Shared Memory Multiprocessors for Ultrasound Propagation Simulations

Use of Multiple GPUs to Speedup the Execution of a Three-Dimensional Computational Model of the Innate Immune System

User-Driven Online Kernel Fusion for SYCL

User’s needs influencing HPC technologies

Uses of GPU Powered Interval Optimization for Parameter Identification in the Context of SO Fuel Cells

Using a GPU to accelerate die and mold fabrication

Using a GPU-CPU architecture to speed up a GA-based real-time system for trading the stock market
Using a GPU, Online Diarization – Offline Diarization

Using AI libraries for Incompressible Computational Fluid Dynamics

Using an OpenCL Framework to Evaluate Interconnect Implementations on FPGAs

Using Artificial Intelligence in Computational Games

Using Butterfly-Patterned Partial Sums to Optimize GPU Memory Accesses for Drawing from Discrete Distributions

Using Commodity Coprocessors for Host Intrusion Detection

Using Commodity Graphics Hardware for Real-Time Digital Hologram View-Reconstruction

Using common graphics hardware for multi-agent traffic simulation with CUDA

Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation

Using Compiler Snippets to Exploit Parallelism on Heterogeneous Hardware: A Java Reduction Case Study

Using Compute Unified Device Architecture (CUDA) in Parallelizing Different Digital Image Processing Techniques

Using CUDA architecture for computer simulations of thermomechanical phenomena

Using CUDA Architecture for the Computer Simulation of the Casting Solidification Process

Using CUDA for Exhaustive Password Recovery

Using CUDA GPU to Accelerate the Ant Colony Optimization Algorithm

Using Data Compression for Increasing Efficiency of Data Transfer Between Main Memory and Intel Xeon Phi Coprocessor or NVidia GPU in Parallel DBMS

Using Deep Convolutional Neural Networks in Monte Carlo Tree Search

Using Deep Reinforcement Learning for Automatic Code Optimization in the MLIR Compiler

Using DRBL to Deploy MPICH2 and CUDA on Green Computing

Using efficient parallelization in Graphic Processing Units to parameterize stochastic fire propagation models

Using Fermi architecture knowledge to speed up CUDA and OpenCL programs

Using generalized ensemble simulations and Markov state models to identify conformational states

Using GPU for query of email spam detection systems and IDS

Using GPU shaders for visualization
Using GPU Shaders for Visualization, Part 2
Using GPU Simulation to Accurately Fit to the Power-Law Distribution

Using GPU to Accelerate Cache Simulation
Using GPU to exploit parallelism on cryptography
Using GPU VSIPL & CUDA to Accelerate RF Clutter Simulation

Using GPU-based Computing To Accelerate Finite Element Problems

Using GPUs for beamforming acceleration on SAFT imaging
Using GPUs for Machine Learning Algorithms
Using GPUs for Realtime Prediction of Optical Forces on Microsphere Ensembles

Using GPUs to Accelerate Installed Antenna Performance Simulations

Using GPUs to Crack Android Pattern-based Passwords

Using GPUs to Improve Multigrid Solver Performance on a Cluster

Using Graph Properties to Speed-up GPU-based Graph Traversal: A Model-driven Approach

Using Graphic Processing Unit in Block Cipher Calculations (thesis)

Using Graphic Processor Units for the Study of Electric Propagation in Realistic Heart Models

Using Graphical Processing Units for Deterministic Single Machine Scheduling Problems

Using Graphical Processing Units in Scheduling Problems

Using graphics devices in reverse: GPU-based Image Processing and Computer Vision

Using Graphics Hardware for Enhancing Edge and Circle Detection

Using Graphics Processing Unit to Accelerate Database Query Execution

Using Graphics Processing Units for Logic Simulation of Electronic Designs

Using graphics processing units to generate random numbers

Using Graphics Processing Units to Parallelize the FDK Algorithm for Tomographic Image Reconstruction

Using Graphics Processing Units to solve the classical N-body problem in physics and astrophysics

Using Graphics Processor Units (GPUs) for Automatic Video Structuring

Using Graphics Processors for a High Performance Normalization of Gene Expressions

Using graphics processors for high performance IR query processing

Using Graphics Processors for High-Performance Computation and Visualization of Plasma Turbulence
Using Graphics Processors for Parallelizing Hash-based Data Carving

Titles: 100
open PDFs: 91
packages: 13
