Papers on hgpu.org (.txt-file)
Interactive Simulations with Navier-Stokes Equations on many-core Architectures

Interactive Soft Tissue for Surgical Simulation

Interactive soft-fabrics watering simulation on GPU
Interactive SPH Simulation and Rendering on the GPU

Interactive Streak Surface Visualization on the GPU

Interactive transparency rendering for large CAD models
Interactive Two-sided Refraction for Dynamic Object on GPU
Interactive visibility culling in complex environments using occlusion-switches

Interactive visualisation of spins and clusters in regular and small-world Ising models with CUDA on GPUs

Interactive Visualization of Molecular Surface Dynamics

Interactive visualization of streaming data with Kernel Density Estimation

Interactive Visualization of the Largest Radioastronomy Cubes

Interactive Visualization of Volumetric White Matter Connectivity in DT-MRI Using a Parallel-Hardware Hamilton-Jacobi Solver

Interactive volume illustration

Interactive Volume Rendering Aurora on the GPU

Interactive Volume Rendering of Functional Representations in Quantum Chemistry

Interactive volumetric lighting simulating scattering and shadowing

Interactive water streams with sphere scan conversion

Interactive, GPU-Based Level Sets for 3D Segmentation

Interactively Rendering Dynamic Caustics on GPU

Interactively Simulating Fluid based on SPH and CUDA

Interconnect Bandwidth Heterogeneity on AMD MI250x and Infinity Fabric

Interective Point Clouds Fairing on Many-Core System
Interference-driven resource management for GPU-based heterogeneous clusters

Interlanguages and synchronic models of computation

Interleaved Learning and Exploration: A Self-Adaptive Fuzz Testing Framework for MLIR

Interleaving and Lock-Step Semantics for Analysis and Verification of GPU Kernels

Intermediate fabrics: virtual architectures for circuit portability and fast placement and routing

Intermediate Language Extensions for Parallelism

Interoperable GPU Kernels as Latency Improver for MEC

InteropUnityCUDA: A Tool for Interoperability Between Unity and CUDA

Interpolated pressure laws in two-fluid simulations and hyperbolicity

Interpolation with Radial Basis Functions on GPGPUs using CUDA

Interpretive OpenGL for computer graphics

Intersecting two families of sets on the GPU

Interventional 4-D Motion Estimation and Reconstruction of Cardiac Vasculature without Motion Periodicity Assumption

Intra-Application Data-Communication Characterization

Intra-node Memory Safe GPU Co-Scheduling

Introducing ‘Bones’: A Parallelizing Source-to-Source Compiler Based on Algorithmic Skeletons

Introducing CURRENNT – the Munich open-source CUDA RecurREnt Neural Network Toolkit

Introducing CURRENNT: The Munich Open-Source CUDA RecurREnt Neural Network Toolkit

Introducing Energy Efficiency into Graphics Processors
Introducing Parallelism to the Ranges TS

Introducing SLAMBench, a performance and accuracy benchmarking methodology for SLAM

Introduction to GPGPU programming
Introduction to GPGPU, a hardware and software background
Introduction to GPU Computing and CUDA Programming: A Case Study on FDTD [EM Programmer’s Notebook]
Introduction to GPU programming for EDA

Introduction to GPU Programming with GLSL

Introduction to GPU Radix Sort

Introduction to the Report “Interlanguages and Synchronic Models of Computation.”

Introduction to the Special Issue on Digital Signal Processing in Radio Astronomy

Intrusion Detection Architecture Utilizing Graphics Processors

Intrusion Detection using Spiking Neural Networks

Inverse scattering and refraction corrected reflection for breast cancer imaging
Investigating Half Precision Arithmetic to Accelerate Dense Linear System Solvers

Investigating Host-Device communication in a GPU-based H.264 encoder

Investigating Input Representations and Representation Models of Source Code for Machine Learning

Investigating performance portability of a highly scalable particle-in-cell simulation code on various multi-core architectures

Investigating performance variations of an optimized GPU-ported granulometry algorithm

Investigating Single Precision Floating General Matrix Multiply in Heterogeneous

Investigating SRAM PUFs in large CPUs and GPUs

Investigating the Impact of Data Parallelism and GPU Technology on Computer Gaming

Investigating the Performance of Motion Estimation Block-Matching Algorithms on GPU Cards

Investigating the use of GPU-accelerated nodes for SAR image formation

Investigating the use of GPUs with a Monte Carlo Astrophysical Simulation

Investigating Warp Size Impact in GPUs

Investigation of General-Purpose Computing on Graphics Processing Units and its Application to the Finite Element Analysis of Electromagnetic Problems

Investigation of GPU-based Pattern Matching

Investigation of heterogeneous computing through novel parallel programming platforms

Investigation of Parallel Computation – MPI, CUDA and Parallel Visualization

Investigation of the OpenCL SYCL Programming Model

Investigation of the SYCL for OpenCL Programming Model

Investigation on the Use of GPGPU for Fast Sparse Matrix Factorization
Invitation to a Standard Programming Interface for Massively Parallel Computing Environment: OpenCL

Invited paper: Accelerating neuromorphic vision on FPGAs

IODA: an Input/Output Deep Architecture for image labeling

IP routing processing with graphic processors

IPMACC: Open Source OpenACC to CUDA/OpenCL Translator

IPMACC: Translating OpenACC API to OpenCL

Iris Matching Algorithm on Many-Core Platforms

Iris recognition on GPU with the usage of Non-Negative Matrix Factorization
Iris: First-Class Multi-GPU Programming Experience in Triton

IRIS: Illustrative Rendering for Integral Surfaces

Irradiation Instability at the Inner Edges of Accretion Disks

Irregular algorithms on the Xeon Phi

Irregularity Mitigation and Portability Abstractions for Accelerated Sparse Matrix Factorization

Is GPGPU CCL worth it? A performance comparison between some GPU and CPU algorithms for solving connected components labeling on binary images

Is OpenCL a suitable platform for algorithm development in health care systems?

Is the game worth the candle? Evaluation of OpenCL for object detection algorithm optimization

Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs

ISM2: Optimizing Irregular-Shaped Matrix-Matrix Multiplication on GPUs

Isocube: Exploiting the Cubemap Hardware

Isolated Scheduling for Distributed Training Tasks in GPU Clusters

Isosurface Extraction and View-Dependent Filtering from Time-Varying Fields Using Persistent Time-Octree (PTOT)

Issues and challenges in compiling for graphics processors

Issues in Heterogenenous GPU Clusters

It’s all about data movement: Optimising FPGA data access to boost performance

Titles: 100
open PDFs: 87
packages: 16
