Papers on hgpu.org (.txt-file)
Generating Efficient Tensor Contractions for GPUs

Generating GPU Code from a High-level Representation for Image Processing Kernels

Generating GPU Compiler Heuristics using Reinforcement Learning

Generating Literature-Driven Scientific Theories at Scale

Generating massive high-quality random numbers using GPU

Generating Null Models for Large-Scale Networks on GPU

Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware

Generating Parallel OpenCL and OpenMP Programs from Dataflow Graphs

Generating Performance Portable Code using Rewrite Rules: From High-level Functional Expressions to High-Performance OpenCL Code

Generating SU(Nc) pure gauge lattice QCD configurations on GPUs with CUDA and OpenMP

Generating subdivision curves with L-systems on a GPU

Generating textures on Surfaces with Reaction-Diffusion systems in the GPU

Generating, Optimizing, and Scheduling a Compiler Level Representation of Stream Parallelism

Generation of Kernels for Calculating Electron Repulsion Integrals of High Angular Momentum Functions on GPUs – Preliminary Results

Generation of planar radiographs from 3D anatomical models using the GPU

Generation of Random Numbers on Graphics Processors: Forced Indentation In Silico of the Bacteriophage HK97

Generation of the Scrambled Halton Sequence Using Accelerators

Generative programming methods for parallel partial differential field equation solvers

Generative Video Compression: Towards 0.01% Compression Rate for Video Transmission

Generic Inverted Index on the GPU

Genetic Algorithm Modeling with GPU Parallel Computing Technology

Genetic Improvement of GPU Software

Genetic Programming An Introductory Tutorial and a Survey of Techniques and Applications

Genetic programming on GPUs for image processing

Genetic programming on graphics processing units

Genetic Programming using the Karva Gene Expression Language on Graphical Processing Units

Genetically Improved BarraCUDA

Genetically Improved CUDA C++ Software

Genetically Improved CUDA kernels for StereoCamera

GenGNN: A Generic FPGA Framework for Graph Neural Network Acceleration

GENIE: a software package for gene-gene interaction analysis in genetic association studies using multiple GPU or CPU cores

GeNN: a code generation framework for accelerated brain simulations

Genomics-GPU: A Benchmark Suite for GPU-accelerated Genome Analysis

GenVectorX: A performance-portable SYCL library for Lorentz Vectors operations

Geo-Correction of High-Resolution Imagery Using Fast Template Matching on a GPU in Emergency Mapping Contexts

Geodesic tree-based dynamic programming for fast stereo reconstruction

Geometric Algebra Computing Technology for Accelerated Processing Units

Geometric Algebra enhanced Precompiler for C++ and OpenCL

Geometric Algebra Enhanced Precompiler for C++, OpenCL and Mathematica’s OpenCLLink

Geometric Optimisation using Karva for Graphical Processing Units

Geometry Based Visualization with OpenCL

Geometry Construction from Caustic Images

Geometry Textures and Applications

Geospatial visualization using hardware accelerated real-time volume rendering

Gerbil: A Fast and Memory-Efficient k-mer Counter with GPU-Support

Getting Started with GPU Programming
GEVO-ML: Optimizing Machine Learning Code with Evolutionary Computation

GEVO: GPU Code Optimization using Evolutionary Computation

GGArray: A Dynamically Growable GPU Array

GGAS: Global GPU Address Spaces for Efficient Communication in Heterogeneous Clusters

GGNN: Graph-based GPU Nearest Neighbor Search

GHOST: Building blocks for high performance sparse linear algebra on heterogeneous systems

GHOST: GPGPU-Offloaded High Performance Storage I/O Deduplication for Primary Storage System

GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics

GIFT: A Real-time and Scalable 3D Shape Search Engine

GigaAPI for GPU Parallelization

GiMMiK – Generating Bespoke Matrix Multiplication Kernels for Various Hardware Accelerators; Applications in High-Order Computational Fluid Dynamics

Ginkgo – A Math Library designed for Platform Portability

Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing

GIS Polygon Overlay Processing: New Parallel Algorithm and System Prototype

GiST Scan Acceleration using Coprocessors

GIST: an interactive, GPU-based level set segmentation tool for 3D medical images

GKLEE: Concolic Verification and Test Generation for GPUs

GL4D: A GPU-based Architecture for Interactive 4D Visualization
gLBM: A GPU enabled Lattice Boltzmann Method Library

Glider: A GPU Library Driver for Improved System Security

Glift: Generic, efficient, random-access GPU data structures

Global Depth from Epipolar Volumes – A General Framework for Reconstructing Non-Lambertian Surfaces

Global finite element matrix construction based on a CPU-GPU implementation

Global Illumination for Advanced Computer Graphics
Global Illumination for Interactive Lighting Design Using Light Path Pre-Computation and Hierarchical Histogram Estimation

Global memory access modelling for efficient implementation of the lattice Boltzmann method on graphics processing units

Global optimization model on power efficiency of GPU and multicore processing element for SIMD computing with CUDA

Global Point Mascon Models for Simple, Accurate and Parallel Geopotential Computation

Globally scheduled real-time multiprocessor systems with GPUs

GLoP: Enabling Massively Parallel Incident Response Through GPU Log Processing

GLOpenCL: OpenCL support on hardware- and software-managed cache multicores
Glow: Graph Lowering Compiler Techniques for Neural Networks

GLSV: Graphics library stereo vision for OpenGL
GLU3.0: Fast GPU-based Parallel Sparse LU Factorization for Circuit Simulation

GMH: A Message Passing Toolkit for GPU Clusters

GMM based Fisher vector calculation on GPGPU

GMP implementation on CUDA – A Backward Compatible Design With Performance Tuning

GMProf: A Low-Overhead, Fine-Grained Profiling Approach for GPU Programs

gNek: A GPU Accelerated Incompressible Navier Stokes Solver

Go game move prediction using convolutional neural network

Going Deeper with Embedded FPGA Platform for Convolutional Neural Network

Going green: optimizing GPUs for energy efficiency through model-steered auto-tuning

Good things come in small packages: Should we adopt Lite-GPUs in AI infrastructure?

GooFit: A library for massively parallelising maximum-likelihood fits

Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

GOST-28147 Encryption Implementation on Graphics Processing Units
GOTHIC: Gravitational oct-tree code accelerated by hierarchical time step controlling

GP on SPMD parallel graphics hardware for mega Bioinformatics data mining

GP-GPU: Bridging the Gap between Modelling & Experimentation
GPA: A GPU Performance Advisor Based on Instruction Sampling

GPApriori: GPU-Accelerated Frequent Itemset Mining

Titles: 100
open PDFs: 93
packages: 33
