Papers on hgpu.org (.txt-file)
Generating and Rendering Procedural Clouds in Real Time on Programmable 3D Graphics Hardware
Generating Binary Optimal Codes Using Heterogeneous Parallel Computing
Generating Custom Code for Efficient Query Execution on Heterogeneous Processors
Generating Device-specific GPU code for Local Operators in Medical Imaging
Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-Memory
Generating Efficient Tensor Contractions for GPUs
Generating GPU Code from a High-level Representation for Image Processing Kernels
Generating GPU Compiler Heuristics using Reinforcement Learning
Generating massive high-quality random numbers using GPU
Generating Null Models for Large-Scale Networks on GPU
Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware
Generating Parallel OpenCL and OpenMP Programs from Dataflow Graphs
Generating Performance Portable Code using Rewrite Rules: From High-level Functional Expressions to High-Performance OpenCL Code
Generating SU(Nc) pure gauge lattice QCD configurations on GPUs with CUDA and OpenMP
Generating subdivision curves with L-systems on a GPU
Generating textures on Surfaces with Reaction-Diffusion systems in the GPU
Generating, Optimizing, and Scheduling a Compiler Level Representation of Stream Parallelism
Generation of Kernels for Calculating Electron Repulsion Integrals of High Angular Momentum Functions on GPUs – Preliminary Results
Generation of planar radiographs from 3D anatomical models using the GPU
Generation of Random Numbers on Graphics Processors: Forced Indentation In Silico of the Bacteriophage HK97
Generation of the Scrambled Halton Sequence Using Accelerators
Generative programming methods for parallel partial differential field equation solvers
Generic Inverted Index on the GPU
Genetic Algorithm Modeling with GPU Parallel Computing Technology
Genetic Improvement of GPU Software
Genetic Programming An Introductory Tutorial and a Survey of Techniques and Applications
Genetic programming on GPUs for image processing
Genetic programming on graphics processing units
Genetic Programming using the Karva Gene Expression Language on Graphical Processing Units
Genetically Improved BarraCUDA
Genetically Improved CUDA C++ Software
Genetically Improved CUDA kernels for StereoCamera
GenGNN: A Generic FPGA Framework for Graph Neural Network Acceleration
GENIE: a software package for gene-gene interaction analysis in genetic association studies using multiple GPU or CPU cores
GeNN: a code generation framework for accelerated brain simulations
Genomics-GPU: A Benchmark Suite for GPU-accelerated Genome Analysis
GenVectorX: A performance-portable SYCL library for Lorentz Vectors operations
Geo-Correction of High-Resolution Imagery Using Fast Template Matching on a GPU in Emergency Mapping Contexts
Geodesic tree-based dynamic programming for fast stereo reconstruction
Geometric Algebra Computing Technology for Accelerated Processing Units
Geometric Algebra enhanced Precompiler for C++ and OpenCL
Geometric Algebra Enhanced Precompiler for C++, OpenCL and Mathematica’s OpenCLLink
Geometric Optimisation using Karva for Graphical Processing Units
Geometry Based Visualization with OpenCL
Geometry Construction from Caustic Images
Geometry Textures and Applications
Geospatial visualization using hardware accelerated real-time volume rendering
Gerbil: A Fast and Memory-Efficient k-mer Counter with GPU-Support
Getting Started with GPU Programming
GEVO-ML: Optimizing Machine Learning Code with Evolutionary Computation
GEVO: GPU Code Optimization using Evolutionary Computation
GGArray: A Dynamically Growable GPU Array
GGAS: Global GPU Address Spaces for Efficient Communication in Heterogeneous Clusters
GGNN: Graph-based GPU Nearest Neighbor Search
GHOST: Building blocks for high performance sparse linear algebra on heterogeneous systems
GHOST: GPGPU-Offloaded High Performance Storage I/O Deduplication for Primary Storage System
GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics
GIFT: A Real-time and Scalable 3D Shape Search Engine
GigaAPI for GPU Parallelization
GiMMiK – Generating Bespoke Matrix Multiplication Kernels for Various Hardware Accelerators; Applications in High-Order Computational Fluid Dynamics
Ginkgo – A Math Library designed for Platform Portability
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing
GIS Polygon Overlay Processing: New Parallel Algorithm and System Prototype
GiST Scan Acceleration using Coprocessors
GIST: an interactive, GPU-based level set segmentation tool for 3D medical images
GKLEE: Concolic Verification and Test Generation for GPUs
GL4D: A GPU-based Architecture for Interactive 4D Visualization
gLBM: A GPU enabled Lattice Boltzmann Method Library
Glider: A GPU Library Driver for Improved System Security
Glift: Generic, efficient, random-access GPU data structures
Global Depth from Epipolar Volumes – A General Framework for Reconstructing Non-Lambertian Surfaces
Global finite element matrix construction based on a CPU-GPU implementation
Global Illumination for Advanced Computer Graphics
Global Illumination for Interactive Lighting Design Using Light Path Pre-Computation and Hierarchical Histogram Estimation
Global memory access modelling for efficient implementation of the lattice Boltzmann method on graphics processing units
Global optimization model on power efficiency of GPU and multicore processing element for SIMD computing with CUDA
Global Point Mascon Models for Simple, Accurate and Parallel Geopotential Computation
Globally scheduled real-time multiprocessor systems with GPUs
GLoP: Enabling Massively Parallel Incident Response Through GPU Log Processing
GLOpenCL: OpenCL support on hardware- and software-managed cache multicores
Glow: Graph Lowering Compiler Techniques for Neural Networks
GLSV: Graphics library stereo vision for OpenGL
GLU3.0: Fast GPU-based Parallel Sparse LU Factorization for Circuit Simulation
GMH: A Message Passing Toolkit for GPU Clusters
GMM based Fisher vector calculation on GPGPU
GMP implementation on CUDA – A Backward Compatible Design With Performance Tuning
GMProf: A Low-Overhead, Fine-Grained Profiling Approach for GPU Programs
gNek: A GPU Accelerated Incompressible Navier Stokes Solver
Go game move prediction using convolutional neural network
Going Deeper with Embedded FPGA Platform for Convolutional Neural Network
Going green: optimizing GPUs for energy efficiency through model-steered auto-tuning
Good things come in small packages: Should we adopt Lite-GPUs in AI infrastructure?
GooFit: A library for massively parallelising maximum-likelihood fits
Google’s Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
GOST-28147 Encryption Implementation on Graphics Processing Units
GOTHIC: Gravitational oct-tree code accelerated by hierarchical time step controlling
GP on SPMD parallel graphics hardware for mega Bioinformatics data mining
Titles: 100
open PDFs: 94
packages: 32