1173

Papers on hgpu.org (.txt-file)

Optimizing All-to-All and Allgather Communications on GPGPU Clusters Download

Optimizing an OpenCL Application for Video Watermarking in FPGAs Download

Optimizing and Auto-tuning Belief Propagation on the GPU Download

Optimizing and tuning the fast multipole method for state-of-the-art multicore architectures Download

Optimizing ASP.NET with C++ AMP on the GPU Download Package

Optimizing Block-Sparse Matrix Multiplications on CUDA with TVM Download Package

Optimizing Communication by Compression for Multi-GPU Scalable Breadth-First Searches Download

Optimizing Communication for Clusters of GPUs Download

Optimizing CUDA Code By Kernel Fusion – Application on BLAS Download

Optimizing CUDA Shared Memory Usage Download

Optimizing data intensive GPGPU computations for DNA sequence alignment Download Package

Optimizing Data Locality for Iterative Matrix Solvers on CUDA Download

Optimizing Data Warehousing Applications for GPUs Using Kernel Fusion/Fission Download

Optimizing dataflow applications on heterogeneous environments Download

Optimizing Deep CNN-Based Queries over Video Streams at Scale Download Package

Optimizing Deep Learning Models For Raspberry Pi Download Package

Optimizing exact computation of Betweenness Centrality for CUDA Download

Optimizing for a Many-Core Architecture without Compromising Ease-of-Programming Download

Optimizing Full Correlation Matrix Analysis of fMRI Data on Intel Xeon Phi Coprocessors Download

Optimizing GPU to GPU Communication on Cray XK7 Download

Optimizing GPU Volume Rendering Download

Optimizing GPU-accelerated Group-By and Aggregation Download

Optimizing Hardware Resource Partitioning and Job Allocations on Modern GPUs under Power Caps Download

Optimizing High-Performance Linpack for Exascale Accelerated Architectures Download Package

Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs Download Package

Optimizing Krylov Subspace Solvers on Graphics Processing Units Download

Optimizing Lempel-Ziv Factorization for the GPU Architecture Download

Optimizing Linpack Benchmark on GPU-Accelerated Petascale Supercomputer Download

Optimizing LZSS Compression on GPGPUs Download

Optimizing MapReduce for GPUs with effective shared memory usage Download

Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs Download

Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs Download

Optimizing memory management on heterogeneous systems using polyhedral, compile-time techniques Download Package

Optimizing Memory-Bound Numerical Kernels on GPU Hardware Accelerators Download Package

Optimizing Monte Carlo radiosity on graphics hardware

Optimizing Network Performance for Distributed DNN Training on GPU Clusters: ImageNet/AlexNet Training in 1.5 Minutes Download

Optimizing OpenCL Kernels for Iterative Statistical Applications on GPUs Download

Optimizing OpenCL Local Work Group Size With Machine Learning Download

Optimizing Performance and Energy Efficiency in Massively Parallel Systems Download Package

Optimizing Performance of Recurrent Neural Networks on GPUs Download Package

Optimizing Performance of Stencil Code with SPL Conqueror Download Package

Optimizing performance per watt on GPUs in High Performance Computing: temperature, frequency and voltage effects Download Package

Optimizing RDF stores by coupling General-purpose Graphics Processing Units and Central Processing Units Download

Optimizing Real Time GPU Kernels Using Fuzzy Inference System Download

Optimizing Similarity Computations for Ontology Matching – Experiences from GOMMA Download

Optimizing simulated annealing on GPU: A case study with IC floorplanning

Optimizing Smith-Waterman algorithm on Graphics Processing Unit

Optimizing Sparse Matrix-Matrix Multiplication for the GPU Download

Optimizing Sparse Matrix-Vector Multiplication on Emerging Many-Core Architectures Download

Optimizing Stencil Computations for NVIDIA Kepler GPUs Download Package

Optimizing strassen matrix multiply on GPUs Download

Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures Download Package

Optimizing Sweep3D for Graphic Processor Unit Download

Optimizing Symmetric Dense Matrix-Vector Multiplication on GPUs Download Package

Optimizing the Computation of Eigenvalues Using Graphics Processing Units Download

Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL

Optimizing the Linear Fascicle Evaluation Algorithm for Multi-Core and Many-Core Systems Download

Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor Download

Optimizing the multipole-to-local operator in the fast multipole method for graphical processing units Download Package

Optimizing the optimizer increasing performance efficiency of modern compilers Download

Optimizing the Performance of Parallel and Concurrent Applications Based on Asynchronous Many-Task Runtimes Download Package

Optimizing the SUSAN corner detection algorithm for a high speed FPGA implementation

Optimizing the Weather Research and Forecasting Model with OpenMP Offload and Codee Download Package

Optimizing Urban Environmental Simulations using Boinc Download

Optimizing Web Virtual Reality Download

Optimizing Xeon Phi for Interactive Data Analysis Download

OptiML: An implicitly parallel domain-specific language for machine learning Download

Optimum Application Deployment Technology for Heterogeneous IaaS Cloud Download

Option Pricing on the GPU

Option pricing with COS method on graphics processing units

Option pricing with multi-dimensional quadrature architectures Download

OptiX: a general purpose ray tracing engine Download

Orca: FSS-based Secure Training with GPUs Download

Orchestrated Scheduling and Prefetching for GPGPUs Download

Orchestrating Multiple Data-Parallel Kernels on Multiple Devices Download

Orchestrating Thread Scheduling and Cache Management to Improve Memory System Throughput in Throughput Processors Download

Orchestration by approximation: mapping stream programs onto multicore architectures Download

Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the International Space Station Download

Origami: A Convolutional Network Accelerator Download

Orion: Interference-aware, Fine-grained GPU Sharing for ML Applications Download Package

Orthogonalization on a General Purpose Graphics Processing Unit with Double Double and Quad Double Arithmetic Download

Orthogononalization on a general purpose graphics processing unit with double double and quad double arithmetic Download

Orthorectification by Using GPGPU Method Download

Out of kernel tuning and optimizations for portable large-scale docking experiments on GPUs Download

Out-of-core cone beam reconstruction using multiple GPUs Download

Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds Download Package

Out-of-core singular value decomposition Download

Out-of-core Training for Extremely Large-Scale Neural Networks With Adaptive Window-Based Scheduling Download Package

Out-of-the-box library support for DBMS operations on GPUs Download Package

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer Download

Over-synchronization in GPU Programs Download Package

Overcoming the GPU memory limitation on FDTD through the use of overlapping subgrids

Overcomplete Dictionary Learning with Jacobi Atom Updates Download

Overdetermined Shooting Methods for Computing Standing Water Waves with Spectral Accuracy Download

Overhauling SC atomics in C11 and OpenCL Download

Overlap fermions on GPUs Download

Overlapping Computation and Communication for Advection on Hybrid Parallel Computers Download

Overlapping computation and communication of three-dimensional FDTD on a GPU cluster Download

Overtaking CPU DBMSes with a GPU in Whole-Query Analytic Processing with Parallelism-Friendly Execution Plan Optimization Download

Overview of approaches for accelerating scale invariant feature detection algorithm

 

Brief statistics for this page

Titles: 100

Download open PDFs: 91

Package packages: 24

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org