Papers on hgpu.org (.txt-file)
Compiler Optimizations for Industrial Unstructured Mesh CFD Applications on GPUs
Compiler Optimizations for SIMD/GPU/Multicore Architectures
Compiler support for general-purpose computation on GPUs
Compiler Support for High-level GPU Programming
Compiler Technologies in Deep Learning Co-Design: A Survey
Compiler-assisted distribution of OpenMP code for improved scalability
Compiler-Assisted Workload Consolidation For Efficient Dynamic Parallelism on GPU
Compiler-based Data Prefetching and Streaming Non-temporal Store Generation for the Intel Xeon Phi Coprocessor
Compiler-Based Tools to Aid in Data Transfer Optimization and On-Chip Debug of Heterogeneous Compute Systems
Compiler-centric across-stack deep learning acceleration
Compiler-directed memory management for heterogeneous MPSoCs
Compiler-Driven Performance on Heterogeneous Computing Platforms
Compiler-Level Explicit Cache for a GPGPU Programming Framework
CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research
Compilers for Portable Programming of Heterogeneous Parallel & Approximate Computing Systems
Compiling a High-level Directive-Based Programming Model for GPGPUs
Compiling a high-level language for GPUs: (via language support for architectures and compilers)
Compiling an Array Language to a Graphics Processor
Compiling and Optimizing Java 8 Programs for GPU Execution
Compiling and Optimizing OpenMP 4.X Programs to OpenCL and SPIR
Compiling for a heterogeneous vector image processor
Compiling High Performance Recursive Filters
Compiling Parallel Functional Code with Data Parallel Idealised Algol
Compiling Python to a hybrid execution environment
Compiling Stream Applications for Heterogeneous Architectures
Complete PISO and SIMPLE solvers on Graphics Processing Units
Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced Memory Accesses on GPU
Complexity effective memory access scheduling for many-core accelerator architectures
Composability of parallel codes on heterogeneous architectures
Composing multiple StarPU applications over heterogeneous machines: a supervised approach
Composition and Reuse with Compiled Domain-Specific Languages
Compositional Compilation for Sparse, Irregular Data Parallelism
Compositional Deep Learning in Futhark
Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs
Compoundly weighted Voronoi: a sequential and parallel implementation
Comprehensive Analysis of High-Performance Computing Methods for Filtered Back-Projection
Comprehensive Evaluation of OpenCL-based Convolutional Neural Network Accelerators in Xilinx and Altera FPGAs
Comprehensive Evaluations of Cone-beam CT dose in Image-guided Radiation Therapy via GPU-based Monte Carlo simulations
Comprehensive Optimization of Parametric Kernels for Graphics Processing Units
Comprehensive Performance Monitoring for GPU Cluster Systems
Compressed Dynamic Mode Decomposition for Real-Time Object Detection
Compressed Facade Displacement Maps
Compressed Learning of Deep Neural Networks for OpenCL-Capable Embedded Systems
Compressed Multiple-Row Storage Format
Compressed Real Numbers for AI: a case-study using a RISC-V CPU
Compressed sensing using hidden Markov models with application to vision based aircraft tracking
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Compressing Floating-Point Number Stream for Numerical Applications
Compression Domain Volume Rendering
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications
Compressive Phase Contrast Tomography
Computation of Air-Vortices Based on GPU Technology: Optimizing and Parallelizing a Model for Wake-Vortex Prediction Using OpenCL
Computation of electron quantum transport in graphene nanoribbons using GPU
Computation of Galois field expressions for quaternary logic functions on GPUs
Computation of gray-level co-occurrence matrix based on CUDA and its optimization
Computation of Large Covariance Matrices by SAMMY on Graphical Processing Units and Multicore CPUs
Computation of the Isogeometric Analysis Stiffness Matrix on GPU
Computation of the Spatial Impulse Response for Ultrasonic Fields on the Graphics Processing Units (GPU)
Computation of Troposphere Slant Delays on a GPU
Computation of Voronoi diagrams using a graphics processing unit
Computation on GPU of Eigenvalues and Eigenvectors of a Large Number of Small Hermitian Matrices
Computation on programmable graphics hardware
Computational advances in gravitational microlensing: a comparison of CPU, GPU, and parallel, large data codes
Computational Biology and Applied Bioinformatics
Computational cost estimates for parallel shared memory isogeometric multi-frontal solvers
Computational Experiments in Markov Chain Monte Carlo
Computational Fluid Dynamic on GPU
Computational Fluid Dynamics Simulations using Many Graphics Processors
Computational Fluid Dynamics Using Graphics Processing Units: Challenges and Opportunities
Computational Fluid Dynamics using OpenCL – a Practical Introduction
Computational Gravitational Dynamics with Modern Numerical Accelerators
Computational investigation of intense short-wavelength laser interaction with rare gas clusters
Computational kinetics of a large scale biological process on GPU workstations: DNA bending
Computational modeling of synthetic microbial biofilms
Computational Modelling of Galaxy Formation using FLAME GPU
Computational Optimization of a Time-Domain Beamforming Algorithm Using CPU and GPU
Computational Performance Predictions for Deep Neural Network Training: A Runtime-Based Approach
Computational Physics on Graphics Processing Units
Computational Simulation of Freely Falling Water Droplets on Graphics Processing Units
Computational stereo camera system with programmable control loop
Computational wave optics library for C++: CWO++ library
Computationally Efficient Algorithms for Evaluation of Statistical Descriptors
Computationally Efficient Implementation of a Hamming Code Decoder using a Graphics Processing Unit
Computationally Efficient Tsunami Modelling on Graphics Processing Units (GPU)
Compute Distance Matrices with GPU
Compute Pairwise Manhattan Distance and Pearson Correlation Coefficient of Data Points with GPU
Compute Unified Device Architecture Application Suitability
Compute units in OpenMP: Extensions for heterogeneous parallel programming
Compute-unified device architecture implementation of a block-matching algorithm for multiple graphical processing unit cards
Computer Finit-Difference Time-Domain Simulation of Electromagnetic Wave Propagation using GPUs
Computer Finite-Difference Time-Domain Simulation of Electromagnetic Wave Propagation using GPUs
Computer generated holography using parallel commodity graphics hardware
Computer Graphics: From Pixels to Programmable Graphics Hardware
Computer Simulation of Dark Matter Effects on Galaxy Rotation
Computer Simulation of Saturn’s Ring Structure
Computer Tomography and Ultrasonography Image Registration Based on the Cooperation of GPU and CPU
Computer Vision Accelerators for Mobile Systems based on OpenCL GPGPU Co-Processing
Computer Vision and Image Segmentation Implemented on GPU Using Compute Unified Device Architecture as Applied on Quality Inspection of Pre-etched Printed Circuit Board
Computer Vision Application in Graphic Processors
Computer vision based geometric calibration in curved multi-projector displays
Titles: 100
open PDFs: 93
packages: 13