Papers on hgpu.org (.txt-file)
Research on Three-Dimensional Playing Video Technology in Virtual Education Environment
Reservoir Simulation on NVIDIA Tesla GPUs
Resolution of Linear Algebra for the Discrete Logarithm Problem using GPU and Multi-core Architectures
Resolution of the Vlasov-Maxwell system by PIC Discontinuous Galerkin method on GPU with OpenCL
Resolving the conflict between generality and plausibility in verified computation
Resource Centered Computing delivering high parallel performance
Resource Elastic Virtualization for FPGAs using OpenCL
Resource Sharing in GPU-Accelerated Windowing Systems
Resource-Aware Compiler Prefetching for Fine-Grained Many-Cores
Resource-Aware Just-in-Time OpenCL Compiler for Coarse-Grained FPGA Overlays
ReSYCLator: Transforming CUDA C++ source code into SYCL
Retargeting and Respecializing GPU Workloads for Performance Portability
Rethinking resampling in the particle filter on graphics processing units
Rethinking Runtime Verification on Hundreds of Cores: Challenges and Opportunities
Rethinking the Union of Computed Tomography Reconstruction and GPGPU Computing
Returning control to the programmer: SIMD intrinsics for virtual machines
RETURNN: The RWTH Extensible Training framework for Universal Recurrent Neural Networks
Reusable OpenCL FPGA Infrastructure
Reusable software components for accelerator-based clusters
Reuse and Refactoring of GPU Kernels to Design Complex Applications
Reusing Auto-Schedules for Efficient DNN Compilation
Reveal training performance mystery between TensorFlow and PyTorch in the single GPU environment
Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature
Reverse Computation for Rollback-based Fault Tolerance in Large Parallel Systems: Evaluating the Potential Gains and Systems Effects
Reverse-Mode AD of Reduce-by-Index and Scan in Futhark
Review and Comparative Study of Ray Traversal Algorithms on a Modern GPU Architecture
Review of Memory/Cache Management Technologies used on Heterogeneous Computing Systems
Review: Kd-tree Traversal Algorithms for Ray Tracing
Reviewing GPU architectures to build efficient back projection for parallel geometries
Revision of Relational Joins for Multi-Core and Many-Core Architectures
Revisit Long Short-Term Memory: An Optimization Perspective
Revisiting Actor Programming in C++
Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture
Revisiting Edge and Node Parallelism for Dynamic GPU Graph Analytics
Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on High-Performance Accelerators
Revisiting Online Autotuning for Sparse-Matrix Vector Multiplication Kernels on Next-Generation Architectures
Revisiting Query Performance in GPU Database Systems
Revisiting sorting for GPGPU stream architectures
Revisiting the Case of ARM SoCs in High-Performance Computing Clusters
Revolutionary technologies for acceleration of emerging petascale applications
RGEM: A Responsive GPGPU Execution Model for Runtime Engines
Rgtsvm: Support Vector Machines on a GPU in R
Ringing: Frugal Subdivision of Curves and Surfaces
Rinnegan: Efficient Resource Use in Heterogeneous Architectures
Ripple: Simplified Large-Scale Computation on Heterogeneous Architectures with Polymorphic Data Layout
Rise of the Graphics Processor
Risk Estimation Without Using Stein’s Lemma — Application to Image Denoising
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks
RNA secondary structure prediction using dynamic programming algorithm – A review and proposed work
RNS-Based Elliptic Curve Point Multiplication for Massive Parallel Architectures
RoadRunner: a fast and flexible exoplanet transit model
Roberts edge detection algorithm based on GPU
Robotic approach to multi-beam optical tweezers with Computer Generated Hologram
Robust Adaptive 3-D Segmentation of Vessel Laminae From Fluorescence Confocal Microscope Images and Parallel GPU Implementation
Robust Computational Tools for Multiple Testing With Genetic Association Studies
Robust Edge Detection and GPU-Based Smoothing for Extracting Surface Primitives from Range Images
Robust foreground segmentation for GPU architecture in an immersive 3D videoconferencing system
Robust GPGPU plugin development for RapidMiner
Robust GPU-assisted camera tracking using free-form surface models
Robust Low Complexity Feature Tracking using CUDA
Robust mesh reconstruction from unoriented noisy points
Robust modified L2 local optical flow estimation and feature tracking
Robust non-local denoising of colored depth data
Robust real time face recognition and tracking on gpu using fusion of rgb and depth image
Robust Real-Time Multiprocessor Interrupt Handling Motivated by GPUs
Rodinia: A benchmark suite for heterogeneous computing
Romou: Rapidly Generate High-Performance Tensor Kernels for Mobile GPUs
Room acoustics modelling using GPU-accelerated finite difference and finite volume methods on a face-centered cubic grid
Rootbeer: Seamlessly using GPUs from Java
Rotationally invariant sparse patch matching on GPU and FPGA
Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born
RSVDPACK: Subroutines for computing partial singular value decompositions via randomized sampling on single core, multi core, and GPU architectures
RTCUDB: Building Databases with RT Processors
RTIndeX: Exploiting Hardware-Accelerated GPU Raytracing for Database Indexing
RTSL: a Ray Tracing Shading Language
RTX Beyond Ray Tracing: Exploring the Use of Hardware Ray Tracing Cores for Tet-Mesh Point Location
RubiCL, a Library Providing Automatic Parallelisation on CPU and GPU devices
Rubus: A compiler for seamless and extensible parallelism
RUMD: A general purpose molecular dynamics package optimized to utilize GPU hardware down to a few thousand particles
Run-time Image and Video Resizing Using CUDA-enabled GPUs
Run-time Reconfigurable Multiprocessors
Run-time support for multi-level disjoint memory address spaces
Run, Stencil, Run! – A Comparison of Modern Parallel Programming Paradigms
Running Financial Risk Management Applications on FPGA in the Amazon Cloud
Running the NIM Next-Generation Weather Model on GPUs
Running unstructured grid-based CFD solvers on modern graphics hardware
Running unstructured grid-based CFD solvers on modern graphics hardware
Runtime Code Generation and Data Management for Heterogeneous Computing in Java
Runtime Comparison of CPU and GPU Using Portable Programming Models
Runtime Compilation of Array-Oriented Python Programs
Runtime Configurable Deep Neural Networks for Energy-Accuracy Trade-off
Runtime Performances Benchmark for Knowledge Graph Embedding Methods
Runtime Specialization for Heterogeneous CPU-GPU Platforms
Runtime Support for Adaptive Power Capping on Heterogeneous SoCs
Runtime Support for Performance Portability on Heterogeneous Distributed Platforms
Runtime Support toward Transparent Memory Access in GPU-accelerated Heterogeneous Systems
Runtime Systems and Scheduling Support for High-End CPU-GPU Architectures
Runtime Visualization of Application Progress and Monitoring of a GPU-enabled Parallel Environment
S-buffer: Sparsity-aware Multi-fragment Rendering
SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures
Titles: 100
Doubles=1
open PDFs: 94
packages: 18