Papers on hgpu.org (.txt-file)
Gaining Cross-Platform Parallelism for HAL’s Molecular Dynamics Package using SYCL

Gaiwan: a Size-Polymorphic Typesystem for GPU Programs

GALAMOST: GPU-accelerated large-scale molecular simulation toolkit

GALARIO: a GPU Accelerated Library for Analysing Radio Interferometer Observations

Galerkin-based multi-scale time integration for nonlinear structural dynamics

Gallatin: A General-Purpose GPU Memory Manager

Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism

GamePipe: A Virtualized Cloud Platform Design and Performance Evaluation

GAMER with out-of-core computation

GAMER-2: a GPU-accelerated adaptive mesh refinement code — accuracy, performance, and scalability

GAMER: a GPU-Accelerated Adaptive Mesh Refinement Code for Astrophysics

GAMUT: GPU accelerated microRNA analysis to uncover target genes through CUDA-miRanda

GARDENIA: A Domain-specific Benchmark Suite for Next-generation Accelerators

GAROP: Genetic Algorithm framework for Running On Parallel environments

GASPP: A GPU-Accelerated Stateful Packet Processing Framework

Gate-Level Simulation with GPU Computing

Gauge Field Generation on Large-Scale GPU-Enabled Systems

Gauge Fixing in Lattice QCD on GPUs

Gauge fixing in lattice QCD with multi-GPUs

Gauge fixing using overrelaxation and simulated annealing on GPUs

Gaussian Mixture Model Based Volume Visualization

Gaussian Process Models with Parallelization and GPU acceleration

Gaussian split Ewald: A fast Ewald mesh method for molecular simulation

GBOOST : A GPU-based tool for detecting gene-gene interactions in genome-wide case control studies

GBOTuner: Autotuning of OpenMP Parallel Codes with Bayesian Optimization and Code Representation Transfer Learning

GC3: An Optimizing Compiler for GPU Collective Communication

GCN Inference Acceleration using High-Level Synthesis

GCS: High-Performance Gate-Level Simulation with GP-GPUs

GCStack+GCScaler: Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis

Gdev: First-Class GPU Resource Management in the Operating System

GDlog: A GPU-Accelerated Deductive Engine

GE-SpMM: General-purpose Sparse Matrix-Matrix Multiplication on GPUs for Graph Neural Networks

Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks

GeantV: from CPU to accelerators

GEARS: A General and Efficient Algorithm for Rendering Shadows

gearshifft – The FFT Benchmark Suite for Heterogeneous Platforms

GeauxDock: Accelerating Structure-Based Virtual Screening with Heterogeneous Computing

GeePS: Scalable deep learning on distributed GPUs with a GPU-specialized parameter server

gem5-gpu: A Heterogeneous CPU-GPU Simulator

gEMfitter: A Highly Parallel FFT-Based 3D Density Fitting Tool With GPU Texture Memory Acceleration

Gemma in April: A matrix-like parallel programming architecture on OpenCL

GEMMbench: a framework for reproducible and collaborative benchmarking of matrix multiplication

gEMpicker: A Highly Parallel GPU-Accelerated Particle Picking Tool for Cryo-Electron Microscopy

GEMTC: GPU Enabled Many-Task Computing

GenBase: A Complex Analytics Genomics Benchmark

General Purpose Computation on Graphics Processing Units Using OpenCL

General purpose computing on graphics processing units using OpenCL

General Purpose Computing on Low-Power Embedded GPUs: Has It Come of Age?

General purpose lattice QCD code set Bridge++ 2.0 for high performance computing

General purpose molecular dynamics simulations fully implemented on graphics processing units

General purpose Molecular Dynamics Simulations on GPUs: Issues of Pair Forces and Scaling to large Clusters

General Transformations for GPU Execution of Tree Traversals

General-Purpose Computing on Tensor Processors

General-purpose GPU computing: practice and experience
General-purpose molecular dynamics simulations on GPU-based clusters

Generalisation in genetic programming

Generalized Resource Allocation for the Cloud

Generalized Voronoi Diagram Computation on GPU

Generalizing Execution of Vectorizable Computations by Generating Vector Oriented Byte Code

Generalizing the Utility of GPUs in Large-Scale Heterogeneous Computing Systems

Generating 3D Topologies with Multiple Constraints on the GPU

Generating and Rendering Procedural Clouds in Real Time on Programmable 3D Graphics Hardware

Generating Binary Optimal Codes Using Heterogeneous Parallel Computing

Generating Custom Code for Efficient Query Execution on Heterogeneous Processors

Generating Device-specific GPU code for Local Operators in Medical Imaging

Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory

Generating Efficient Tensor Contractions for GPUs

Generating GPU Code from a High-level Representation for Image Processing Kernels

Generating GPU Compiler Heuristics using Reinforcement Learning

Generating massive high-quality random numbers using GPU

Generating Null Models for Large-Scale Networks on GPU

Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware

Generating Parallel OpenCL and OpenMP Programs from Dataflow Graphs

Generating Performance Portable Code using Rewrite Rules: From High-level Functional Expressions to High-Performance OpenCL Code

Generating SU(Nc) pure gauge lattice QCD configurations on GPUs with CUDA and OpenMP

Generating subdivision curves with L-systems on a GPU

Generating textures on Surfaces with Reaction-Diffusion systems in the GPU

Generating, Optimizing, and Scheduling a Compiler Level Representation of Stream Parallelism

Generation of Kernels for Calculating Electron Repulsion Integrals of High Angular Momentum Functions on GPUs – Preliminary Results

Generation of planar radiographs from 3D anatomical models using the GPU

Generation of Random Numbers on Graphics Processors: Forced Indentation In Silico of the Bacteriophage HK97

Generation of the Scrambled Halton Sequence Using Accelerators

Generative programming methods for parallel partial differential field equation solvers

Generic Inverted Index on the GPU

Genetic Algorithm Modeling with GPU Parallel Computing Technology

Genetic Improvement of GPU Software

Genetic Programming An Introductory Tutorial and a Survey of Techniques and Applications

Genetic programming on GPUs for image processing

Genetic programming on graphics processing units

Genetic Programming using the Karva Gene Expression Language on Graphical Processing Units

Genetically Improved BarraCUDA

Genetically Improved CUDA C++ Software

Genetically Improved CUDA kernels for StereoCamera

GenGNN: A Generic FPGA Framework for Graph Neural Network Acceleration

GENIE: a software package for gene-gene interaction analysis in genetic association studies using multiple GPU or CPU cores

GeNN: a code generation framework for accelerated brain simulations

Genomics-GPU: A Benchmark Suite for GPU-accelerated Genome Analysis

GenVectorX: A performance-portable SYCL library for Lorentz Vectors operations

Titles: 100
open PDFs: 98
packages: 35
