Papers on hgpu.org (.txt-file)
Advanced 2D Rasterization on Modern CPUs
Advanced Architectures for Astrophysical Supercomputing
Advanced CFD Modeling Using GeForce GPUs
Advanced Concurrency Control Algorithm Design and GPU System Support for High Performance In-Memory Data Management
Advanced illumination techniques for GPU volume raycasting
Advanced MRI reconstruction toolbox with accelerating on GPU
Advanced Multi-Frame Rate Rendering Techniques
Advanced Optimization Techniques for Sparse Grids on Modern Heterogeneous Systems
Advanced Optimizations of An Implicit Navier-Stokes Solver on GPGPU
Advanced Programming Platform for efficient use of Data Parallel Hardware
Advanced Simulation Library: Expanding software ecosystem for the DSP/FPGA/GPU market
Advanced Techniques for the Rendering and Visualization of Volumetric Seismic Data
Advanced Trends of Heterogeneous Computing with CPU-GPU Integration: Comparative Study
Advanced ultrasound beam forming using GPGPU technology
Advanced Video Coding on CPUs and GPUs: Parallelization and RD Analysis
Advances in Electron Microscopy with Deep Learning
Advancing Large Scale Many-Body QMC Simulations on GPU Accelerated Multicore Systems
Advancing the distributed Multi-GPU ChASE library through algorithm optimization and NCCL library
Advantages and GPU implementation of high-performance indexed DNA search based on suffix arrays
Adventures in the microlensing cloud: large datasets, eResearch tools, and GPUs
ADWPNAS: Architecture-Driven Weight Prediction for Neural Architecture Search
AeminiumGPU: A CPU-GPU Hybrid Runtime for the Aeminium Language
AeminiumGPU: An Intelligent Framework for GPU Programming
Aeolian Sand Movement and Interacting with Vegetation: A GPU Based Simulation and Visualization Method
AES Algorithm Adapted on GPU Using CUDA for Small Data and Large Data Volume Encryption
AES and DES Encryption with GPU
AES Encryption Algorithm Based on the High Performance Computing of GPU
AES Encryption and Decryption Using Direct3D 10 API
AES Encryption Implementation and Analysis on Commodity Graphics Processing Units
AES Encryption Implementation on CUDA GPU and Its Analysis
AES encryption on modern consumer architectures
AES finalists implementation for GPU and multi-core CPU based on OpenCL
AES on GPU: a CUDA Implementation
Affine Vector Cache for memory bandwidth savings
AFiD-GPU: a versatile Navier-Stokes Solver for Wall-Bounded Turbulent Flows on GPU Clusters
AFOCL: Portable OpenCL Programming of FPGAs via Automated Built-in Kernel Management
Age and Gender Classification using Convolutional Neural Networks
Ageing at the Spin-Glass/Ferromagnet Transition: Monte Carlo Simulation using GPUs
Agent-based crowd simulation using GPU computing
Agent-Based Modeling on High Performance Computing Architectures
Aggregate Gaze Visualization with Real-time Heatmaps
Aging in the three-dimensional Random Field Ising Model
AI Benchmark: All About Deep Learning on Smartphones in 2019
AI Benchmark: Running Deep Neural Networks on Android Smartphones
AIPerf: Automated machine learning as an AI-HPC benchmark
Air pollution modelling using a graphics processing unit with CUDA
Airborne Downward Looking Sparse Linear Array 3-D SAR Heterogeneous Parallel Simulation
Airborne radar clutter simulation using GPU (CUDA)
Akid: A Library for Neural Network Research and Production from a Dataism Approach
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Algebraic 3D Reconstruction of Planetary Nebulae
Algebraic Splats Representation for Point Based Models
Algorithm 9xx: Sparse QR Factorization on the GPU
Algorithm Acceleration from GPGPUs for the ATLAS Upgrade
Algorithm and implementation of multi-channel spike sorting using GPU in a home-care surveillance system
Algorithm Construction for GPGPU
Algorithm for Sparse Approximate Inverse Preconditioners in the Conjugate Gradient Method
Algorithmic and Software System Support to Accelerate Data Processing in CPU-GPU Hybrid Computing Environments
Algorithmic Contributions to the Theory of Regular Chains
Algorithmic Differentiation: Application to Variational Problems in Computer Vision
Algorithmic GPGPU Memory Optimization
Algorithmic performance studies on graphics processing units
Algorithmic Skeleton Framework for the Orchestration of GPU Computations
Algorithmic Trading: A brief, computational finance case study on data centre FPGAs
Algorithms acceleration of pattern-matching in multi-core architectures
Algorithms and Data Structures for Interactive Ray Tracing on Commodity Hardware
Algorithms and Heuristics for Scalable Betweenness Centrality Computation on Multi-GPU Systems
Algorithms for Compression on GPUs
Algorithms for Large-Scale Power Delivery Network Analysis on Massively Parallel Architectures
Algorithms for manipulating large geometric data
Algorithms for Rapid Characterization and Optimization of Aperture and Reflector Antennas
Algorithms for representation of 3D regions in radiotherapy planning software
Algorithms for Solving Non-Stationary Heat Conduction Problem for Design of a Technical Device
Algorithms for the mapping of genome sequences in GPGPU
ALICE HLT High Speed Tracking on GPU
Alignator: A GPU powered software package for robust fiducial-less alignment of cryo tilt-series
Alignment invariant image comparison implemented on the GPU
All-pairs Shortest Path Algorithm based on MPI+CUDA Distributed Parallel Programming Model
All-Pairs Shortest Path Algorithms Using CUDA
All-pairs shortest-paths for large graphs on the GPU
Alpaka – An Abstraction Library for Parallel Kernel Acceleration
Alpha-Beta Divergences Discover Micro and Macro Structures in Data
ALPINIST: An Annotation-Aware GPU Program Optimizer
ALPyNA: Acceleration of Loops in Python for Novel Architectures
Alternating Maximization: Unifying Framework for 8 Sparse PCA Formulations and Efficient Parallel Codes
Ambient Occlusion and Edge Cueing for Enhancing Real Time Molecular Visualization
Ameliorating Memory Contention of OLAP operators on GPU Processors
American Basket Option Pricing on a multi GPU Cluster
American Options Based on Malliavin Calculus and Nonparametric Variance Reduction Methods
American Options Pricing on Multi-core Graphic Cards
AMGCL – A C++ library for efficient solution of large sparse linear systems
AMGCL: an Efficient, Flexible, and Extensible Algebraic Multigrid Implementation
An 8.6 mW 25 Mvertices/s 400-MFLOPS 800-MOPS 8.91 mm Multimedia Stream Processor Core for Mobile Applications
An 80-Fold Speedup, 15.0 TFlops Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code
An abstract object oriented runtime system for heterogeneous parallel architecture
An Accelerated 3D Navier-Stokes Solver for Flows in Turbomachines
Titles: 100
open PDFs: 91
packages: 21