Papers on hgpu.org (.txt-file)
Graph Analysis with High-Performance Computing
Graph Coarsening and Clustering on the GPU
Graph Generation on GPUs using Dynamic Memory Allocation
Graph grammar based multi-frontal direct solver for isogeometric FEM simulations on GPU
Graph Processing on GPUs: A Survey
Graph-based Parallel Analysis of Large Analog Circuits Based on GPU Platforms
Graph-Based Substructure Pattern Mining Using CUDA Dynamic Parallelism
GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
Graphic Processing Unit Simulation of Axon Growth and Guidance through Cue Diffusion on Massively Parallel Processors
Graphic processing unit-accelerated mutual information-based 3D image rigid registration
Graphic processors to speed-up simulations for the design of high performance solar receptors
Graphic-Card Cluster for Astrophysics (GraCCA) – Performance Tests
Graphic-Processing-Units Based Adaptive Parameter Estimation of a Visual Psychophysical Model
Graphical processing unit implementation of an integrated shape-based active contour: Application to digital pathology
Graphical Processing Units (GPU) acceleration of finite-difference frequency-domain (FDFD) technique
Graphical Processing Units (GPU)-based modeling for Acoustic and Ultrasonic NDE
Graphical Processing Units for Quantum Chemistry
Graphics Card as a Cheap Supercomputer
Graphics hardware & GPU computing: past, present, and future
Graphics Hardware based Efficient and Scalable Fuzzy C-Means Clustering
Graphics Hardware Implementation of the Parameter-Less Self-organising Map
Graphics Hardware-Based Level-Set Method for Interactive Segmentation and Visualization
Graphics Processing Unit (GPU) Implementation Methodology of AERMOD Model
Graphics processing unit (GPU) programming strategies and trends in GPU computing
Graphics processing unit accelerated non-uniform fast Fourier transform for ultrahigh-speed, real-time Fourier-domain OCT
Graphics Processing Unit Accelerated O(N) Micromagnetic Solver
Graphics Processing Unit Acceleration of the Explicit Solution of the Time Domain Volume Integral Equation Using OpenACC
Graphics Processing Unit acceleration of the Random Phase Approximation in the projector augmented wave method
Graphics Processing Unit Audio Signals Processing in Pure Data and PdCUDA an Implementation with the CUDA Runtime API
Graphics Processing Unit based searching the critical slip surface of slopes by the Vector Sum Analysis Method
Graphics Processing Unit Bloom Filters: Classical and Probabilistic
Graphics processing unit implementation of lattice Boltzmann models for flowing soft systems
Graphics processing unit implementations of relative expression analysis algorithms enable dramatic computational speedup
Graphics Processing Unit Utilization in Circuit Simulation
Graphics processing unit–accelerated holography by simulated annealing
Graphics Processing Unit-Accelerated Quantitative Trait Loci Detection
Graphics Processing Unit-Based Computer-Aided Design Algorithms for Electronic Design Automation
Graphics Processing Units and Genetic Programming: An overview
Graphics Processing Units and High-Dimensional Optimization
Graphics Processing Units for Handhelds
Graphics Processing Units for the Real-time Linear Elastostatic Simulation of Liver
Graphics Processing Units in Acceleration of Bandwidth Selection for Kernel Density Estimation
Graphics Processing Units: More Than the Pathway to Realistic Video-Games
Graphics Processor Clusters for High Speed Backpropagation
Graphics Processor Unit (GPU) Acceleration of Finite-Difference Frequency-Domain (FDFD) Method
Graphics processor unit (GPU) acceleration of finite-difference time-domain (FDTD) algorithm
Graphics Programming on the Web WebCL Course Notes
Graphics Supercomputing Applied to Brain Image Analysis with NiftyReg
Graphtoy: Fast Software Simulation of Applications for AMD’s AI Engines
GraphVite: A High-Performance CPU-GPU Hybrid System for Node Embedding
GRATER: An Approximation Workflow for Exploiting Data-Level Parallelism in FPGA Acceleration
GraviDy: a GPU modular, parallel N-body integrator
Gravitational tree-code on graphics processing units: implementation in CUDA
Gravitational wave astrophysics, data analysis and multimessenger astronomy
GrAVity: a massively parallel antivirus engine
GRay: a Massively Parallel GPU-Based Code for Ray Tracing in Relativistic Spacetimes
Green AI: A Preliminary Empirical Study on Energy Consumption in DL Models Across Different Runtime Infrastructures
GreenGPU: A Holistic Approach to Energy Efficiency in GPU-CPU Heterogeneous Architectures
Grex: An efficient MapReduce framework for graphics processing units
Grid-based SAH BVH construction on a GPU
Grids, Clouds and Virtualization
grim: A Flexible, Conservative Scheme for Relativistic Fluid Theories
GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight Sparsity
GrIP: A Framework for Experiments with Screen Space Algorithms
GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition
GROMACS on AMD GPU-Based HPC Platforms: Using SYCL for Performance and Portability
GROMACS on Hybrid CPU-GPU and CPU-MIC Clusters: Preliminary Porting Experiences, Results and Next Steps
GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers
GROPHECY: GPU performance projection from CPU code skeletons
Group Marching Tree: Sampling-Based Approximately Optimal Motion Planning on GPUs
Grover: Looking for Performance Improvement by Disabling Local Memory Usage in OpenCL Kernels
GRS – GPU radix sort for multifield records
gScan: Accelerating Graham Scan on the GPU
gSLIC: a real-time implementation of SLIC superpixel segmentation
gSLICr: SLIC superpixels at over 250Hz
gSMat: A Scalable Sparse Matrix-based Join for SPARQL Query Processing
GSNP: A DNA Single-Nucleotide Polymorphism Detection System with GPU Acceleration
GStream: A General-Purpose Data Streaming Framework on GPU Clusters
gSuite: A Flexible and Framework Independent Benchmark Suite for Graph Neural Network Inference on GPUs
GT4Py: High Performance Stencils for Weather and Climate Applications using Python
GUESS-ing Polygenic Associations with Multiple Phenotypes Using a GPU-Based Evolutionary Stochastic Search Algorithm
Guided Profiling for Auto-Tuning Array Layouts on GPUs
Gunrock: A High-Performance Graph Processing Library on the GPU
Gvim: Gpu-accelerated virtual machines
Gyrofluid Modeling of Turbulent, Kinetic Physics
Gyrokinetic Particle-in-Cell Optimization on Emerging Multi- and Manycore Platforms
Gyrokinetic Toroidal Simulations on Leading Multi-and Manycore HPC Systems
gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters
H- and C-level WFST-based large vocabulary continuous speech recognition on Graphics Processing Units
H-LU Factorization on Many-Core Systems
H. 264 Parallel Optimization on Graphics Processors
H.264/AVC motion estimation implementation on Compute Unified Device Architecture (CUDA)
HACC: Simulating Sky Surveys on State-of-the-Art Supercomputing Architectures
HAccRG: Hardware-Accelerated Data Race Detection in GPUs
Hacking Neural Networks: A Short Introduction
Titles: 100
open PDFs: 89
packages: 23