Papers on hgpu.org (.txt-file)
High-Performance General Solver for Extremely Large-Scale Semidefinite Programming Problems
High-Performance GPGPU Programming with OCaml
High-performance GPU based Rendering for Real-Time, rigid 2D/3D-Image Registration in Radiation Oncology
High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs
High-Performance High-Order Stencil Computation on FPGAs Using OpenCL
High-Performance Holistic XML Twig Filtering Using GPUs
High-Performance Image Synthesis for Radio Interferometry
High-performance Implementations and Large-scale Validation of the Link-wise Artificial Compressibility Method
High-Performance Interactive Scientific Visualization With Datoviz via the Vulkan Low-Level GPU API
High-Performance Iterative Electron Tomography Reconstruction with Long-Object Compensation using Graphics Processing Units (GPUs)
High-Performance Location-Aware Publish-Subscribe on GPUs
High-Performance Matrix-Vector Multiplication on the GPU
High-Performance Monte Carlo Radiosity on GPU based on Scene Partitioning
High-Performance Multi-View Reconstruction
High-Performance Neural Networks for Visual Object Classification
High-Performance Online Spatial and Temporal Aggregations on Multi-core CPUs and Many-Core GPUs
High-Performance Out-of-core Block Randomized Singular Value Decomposition on GPU
High-Performance Physics Simulations Using Multi-Core CPUs and GPGPUs in a Volunteer Computing Context
High-performance polynomial GCD computations on graphics processors
High-Performance Pseudo-Random Number Generation on Graphics Processing Units
High-Performance Reverse Time Migration on GPU
High-performance SIMT code generation in an active visual effects library
High-performance software rasterization on GPUs
High-performance sparse matrix-matrix products on Intel KNL and multicore architectures
High-performance sparse matrix-vector multiplication on GPUs for structured grid computations
High-Performance Spatial Join Processing on GPGPUs with Applications to Large-Scale Taxi Trip Data
High-Performance Spatial Query Processing on Big Taxi Trip Data using GPGPUs
High-Performance Symmetric Block Ciphers on Multicore CPU and GPUs
High-Performance Tensor Contractions for GPUs
High-Performance Zonal Histogramming on Large-Scale Geospatial Rasters Using GPUs and GPU-Accelerated Clusters
High-precision molecular dynamics simulation of UO2-PuO2: Anion self-diffusion in UO2
High-precision molecular dynamics simulation of UO2-PuO2: pair potentials comparison
High-precision molecular dynamics simulation of UO2-PuO2: superionic transition in uranium dioxide
High-precision Monte Carlo study of the three-dimensional XY model on GPU
High-Precision Numerical Simulations of Rotating Black Holes Accelerated by CUDA
High-Quality Point-Based Rendering on Modern GPUs
High-quality pre-integrated volume rendering using hardware-accelerated pixel shading
High-quality Real-time Stereo using Adaptive Cost Aggregation and Dynamic Programming
High-Quality Rendering of Quartic Spline Surfaces on the GPU
High-Quality Rendering of Varying Isosurfaces with Cubic Trivariate C1-Continuous Splines
High-quality surface splatting on today’s GPUs
High-Quality, Semi-Analytical Volume Rendering for AMR Data
High-Speed Dense Stereo Via Directional Center-Biased Windows on Graphics Hardware
High-speed electromagnetic field simulation by HIE-FDTD method with GPGPU
High-Speed GPU-Based Fully Three-Dimensional Diffuse Optical Tomographic System
High-Speed Implementations of Block Cipher ARIA Using Graphics Processing Units
High-Speed Object Detection: Design, Study and Implementation of a Detection Framework using Channel Features and Boosting
High-Speed Private Information Retrieval Computation on GPU
High-Speed Stream-Centric Dense Stereo and View Synthesis on Graphics Hardware
High-Speed Turbo Equalization for GPP-based Software Defined Radios
High-speed volume ray casting with CUDA
High-Throughput All-Atom Molecular Dynamics Simulations Using Distributed Computing
High-throughput Analysis of Large Microscopy Image Datasets on CPU-GPU Cluster Platforms
High-throughput bayesian computing machine with reconfigurable hardware
High-throughput Bayesian network learning using heterogeneous multicore computers
High-throughput Execution of Hierarchical Analysis Pipelines on Hybrid Cluster Platforms
High-Throughput parallel blind Virtual Screening using BINDSURF
High-Throughput Parallel Viterbi Decoder on GPU Tensor Cores
High-throughput protein crystallization on the World Community Grid and the GPU
High-throughput sequence alignment using Graphics Processing Units
High-Throughput Sequence Translation Using CUDA
High-throughput stream categorization and intrusion detection on GPU
High-Throughput Transaction Executions on Graphics Processors
Higher order FEM numerical integration on GPUs with OpenCL
Higher-order CFD and Interface Tracking Methods on Highly-Parallel MPI and GPU systems
Highly accelerated feature detection in proteomics data sets using modern graphics processing units
Highly accelerated simulations of glassy dynamics using GPUs: caveats on limited floating-point precision
Highly Efficient 8-bit Low Precision Inference of Convolutional Neural Networks with IntelCaffe
Highly Efficient Forward and Backward Propagation of Convolutional Neural Networks for Pixelwise Classification
Highly Efficient Lattice-Boltzmann Multiphase Simulations of Immiscible Fluids at High-Density Ratios on CPUs and GPUs through Code Generation
Highly efficient mapping of the Smith-Waterman algorithm on CUDA-compatible GPUs
Highly interactive computational steering for coupled 3D flow problems utilizing multiple GPUs
Highly Optimized Full GPU-Acceleration of Non-hydrostatic Weather Model SCALE-LES
Highly optimized simulations on single- and multi-GPU systems of 3D Ising spin glass
Highly parallel decoding of space-time codes on graphics processing units
Highly Parallel Rate-Distortion Optimized Intra-Mode Decision on Multicore Graphics Processors
Highly Scalable Multi Objective Test Suite Minimisation Using Graphics Cards
Highly Scalable Multiplication for Distributed Sparse Multivariate Polynomials on Many-core Systems
Hinomiyagura Infrastructure Competiton TDP: Platform of rescue simulation using GPGPU
HIPAcc: A Domain-Specific Language and Compiler for Image Processing
HipBone: A performance-portable GPU-accelerated C++ version of the NekBone benchmark
HIPRT: A Ray Tracing Framework in HIP
HiRace: Accurate and Fast Source-Level Race Checking of GPU Programs
HISQ inverter on Intel Xeon Phi and NVIDIA GPUs
Histogram Computations on GPUs Kernel using Global and Shared Memory Atomics
Historic Learning Approach for Auto-tuning OpenACC Accelerated Scientific Applications
Historygrams: Enabling Interactive Global Illumination in Direct Volume Rendering using Photon Mapping
HLS Portability from Intel to Xilinx: A Case Study
hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices
HLSDataset: Open-Source Dataset for ML-Assisted FPGA Design using High Level Synthesis
hlslib: Software Engineering for Hardware Design
HOCL: A Family of Embedded Languages
Homomorphic-Encrypted Volume Rendering
Homunculus Warping: Conveying importance using self-intersection-free non-homogeneous mesh deformation
HONEI: A collection of libraries for numerical computations targeting multiple processor architectures
HORIZON: Accelerated General Relativistic Magnetohydrodynamics
Titles: 100
open PDFs: 86
packages: 19