Papers on hgpu.org (.txt-file)
Uniform partitioning of Monte Carlo radiosity on GPUs
Unifying stream based and reconfigurable computing to design application accelerators

Unleashing the Power of Distributed CPU/GPU Architectures: Massive Astronomical Data Analysis and Visualization case study

Unlocking Bandwidth for GPUs in CC-NUMA Systems

Unsafe Floating-point to Unsigned Integer Casting Check for GPU Programs

Unstructured grid applications on GPU: performance analysis and improvement

Unsupervised Asset Cluster Analysis Implemented with Parallel Genetic Algorithms on the NVIDIA CUDA Platform

Unsupervised Deep Learning of Incompressible Fluid Dynamics

Unsupervised Markovian Segmentation on Graphics Hardware

Up to 700k GPU cores, Kepler, and the Exascale future for simulations of star clusters around black holes

UPC on MIC: Early Experiences with Native and Symmetric Modes

Urban Regional Seismic Damage Prediction Based On GPU-CPU Hybrid Computing

Usable assembly language for GPUs: a success story

Use NVIDIA CUDA technology to create genetic algorithms with extensive population

Use of Checkpoint-Restart for Complex HEP Software on Traditional Architectures and Intel MIC

Use of CUDA for the Continuous Space Language Model

Use of CUDA Parallel Computing Technology in Modeling of Solid Mineral Deposits

Use of FPGA or GPU-based architectures for remotely sensed hyperspectral image processing

Use of modern GPUs in Design Optimization

Use of Multi-GPU Systems for Larger Than Device FFTs: With Applications in Ultrasound Simulations

Use of Multiple GPUs on Shared Memory Multiprocessors for Ultrasound Propagation Simulations

Use of Multiple GPUs to Speedup the Execution of a Three-Dimensional Computational Model of the Innate Immune System

User-Driven Online Kernel Fusion for SYCL

User’s needs influencing HPC technologies

Uses of GPU Powered Interval Optimization for Parameter Identification in the Context of SO Fuel Cells

Using a GPU to accelerate die and mold fabrication

Using a GPU-CPU architecture to speed up a GA-based real-time system for trading the stock market
Using a GPU, Online Diarization – Offline Diarization

Using AI libraries for Incompressible Computational Fluid Dynamics

Using an OpenCL Framework to Evaluate Interconnect Implementations on FPGAs

Using Artificial Intelligence in Computational Games

Using Butterfly-Patterned Partial Sums to Optimize GPU Memory Accesses for Drawing from Discrete Distributions

Using Commodity Coprocessors for Host Intrusion Detection

Using Commodity Graphics Hardware for Real-Time Digital Hologram View-Reconstruction

Using common graphics hardware for multi-agent traffic simulation with CUDA

Using Compiler Directives for Performance Portability in Scientific Computing: Kernels from Molecular Simulation

Using Compiler Snippets to Exploit Parallelism on Heterogeneous Hardware: A Java Reduction Case Study

Using Compute Unified Device Architecture (CUDA) in Parallelizing Different Digital Image Processing Techniques

Using CUDA architecture for computer simulations of thermomechanical phenomena

Using CUDA Architecture for the Computer Simulation of the Casting Solidification Process

Using CUDA for Exhaustive Password Recovery

Using CUDA GPU to Accelerate the Ant Colony Optimization Algorithm

Using Data Compression for Increasing Efficiency of Data Transfer Between Main Memory and Intel Xeon Phi Coprocessor or NVidia GPU in Parallel DBMS

Using Deep Convolutional Neural Networks in Monte Carlo Tree Search

Using Deep Reinforcement Learning for Automatic Code Optimization in the MLIR Compiler

Using DRBL to Deploy MPICH2 and CUDA on Green Computing

Using efficient parallelization in Graphic Processing Units to parameterize stochastic fire propagation models

Using Fermi architecture knowledge to speed up CUDA and OpenCL programs

Using generalized ensemble simulations and Markov state models to identify conformational states

Using GPU for query of email spam detection systems and IDS

Using GPU shaders for visualization
Using GPU Shaders for Visualization, Part 2
Using GPU Simulation to Accurately Fit to the Power-Law Distribution

Using GPU to Accelerate Cache Simulation
Using GPU to exploit parallelism on cryptography
Using GPU VSIPL & CUDA to Accelerate RF Clutter Simulation

Using GPU-based Computing To Accelerate Finite Element Problems

Using GPUs for beamforming acceleration on SAFT imaging
Using GPUs for Machine Learning Algorithms
Using GPUs for Realtime Prediction of Optical Forces on Microsphere Ensembles

Using GPUs to Accelerate Installed Antenna Performance Simulations

Using GPUs to Crack Android Pattern-based Passwords

Using GPUs to Improve Multigrid Solver Performance on a Cluster

Using Graph Properties to Speed-up GPU-based Graph Traversal: A Model-driven Approach

Using Graphic Processing Unit in Block Cipher Calculations (thesis)

Using Graphic Processor Units for the Study of Electric Propagation in Realistic Heart Models

Using Graphical Processing Units for Deterministic Single Machine Scheduling Problems

Using Graphical Processing Units in Scheduling Problems

Using graphics devices in reverse: GPU-based Image Processing and Computer Vision

Using Graphics Hardware for Enhancing Edge and Circle Detection

Using Graphics Processing Unit to Accelerate Database Query Execution

Using Graphics Processing Units for Logic Simulation of Electronic Designs

Using graphics processing units to generate random numbers

Using Graphics Processing Units to Parallelize the FDK Algorithm for Tomographic Image Reconstruction

Using Graphics Processing Units to solve the classical N-body problem in physics and astrophysics

Using Graphics Processor Units (GPUs) for Automatic Video Structuring

Using Graphics Processors for a High Performance Normalization of Gene Expressions

Using graphics processors for high performance IR query processing

Using Graphics Processors for High-Performance Computation and Visualization of Plasma Turbulence
Using Graphics Processors for Parallelizing Hash-based Data Carving

Using Graphics Processors to Accelerate Synthetic Aperture Sonar Imaging via Backpropagation

Using graphics processors to accelerate the computation of the matrix inverse
Using Graphics Processors to Accelerate the Solution of Out-of-Core Linear Systems

Using hardware performance counters to speed up autotuning convergence on GPUs

Using high performance computing and Monte Carlo simulation for pricing american options

Using High Performance Computing for Optimizing Credit Risk Calculation

Using High Performance Computing to Improve Image Guided Cancer Treatment

Using Hybrid CPU-GPU Platforms to Accelerate the Computation of the Matrix Sign Function

Using hybrid GPU/CPU kernel splitting to accelerate spherical convolutions

Using Hybrid Shared and Distributed Caching for Mixed-Coherency GPU Workloads

Using Image Morphing for Memory-Efficient Impostor Rendering on GPU

Using Intel oneAPI for Multi-hybrid Acceleration Programming with GPU and FPGA Coupling

Using JavaScript and WebCL for Numerical Computations: A Comparative Study of Native and Web Technologies

Using Machine Learning to Estimate Utilization and Throughput for OpenCL-Based SpMV Implementation on an FPGA

Using many-core hardware to correlate radio astronomy signals

Using Meta-heuristics and Machine Learning for Software Optimization of Parallel Computing Systems: A Systematic Literature Review

Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy

Using mobile GPU for general-purpose computing – a case study of face recognition on smartphones

Titles: 100
open PDFs: 89
packages: 11
