1173

Papers on hgpu.org (.txt-file)

Optimizing CUDA Shared Memory Usage Download

Optimizing data intensive GPGPU computations for DNA sequence alignment Download Package

Optimizing Data Locality for Iterative Matrix Solvers on CUDA Download

Optimizing Data Warehousing Applications for GPUs Using Kernel Fusion/Fission Download

Optimizing dataflow applications on heterogeneous environments Download

Optimizing Deep CNN-Based Queries over Video Streams at Scale Download Package

Optimizing Deep Learning Models For Raspberry Pi Download Package

Optimizing exact computation of Betweenness Centrality for CUDA Download

Optimizing for a Many-Core Architecture without Compromising Ease-of-Programming Download

Optimizing Full Correlation Matrix Analysis of fMRI Data on Intel Xeon Phi Coprocessors Download

Optimizing GPU to GPU Communication on Cray XK7 Download

Optimizing GPU Volume Rendering Download

Optimizing GPU-accelerated Group-By and Aggregation Download

Optimizing Hardware Resource Partitioning and Job Allocations on Modern GPUs under Power Caps Download

Optimizing High-Performance Linpack for Exascale Accelerated Architectures Download Package

Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs Download Package

Optimizing Krylov Subspace Solvers on Graphics Processing Units Download

Optimizing Lempel-Ziv Factorization for the GPU Architecture Download

Optimizing Linpack Benchmark on GPU-Accelerated Petascale Supercomputer Download

Optimizing LZSS Compression on GPGPUs Download

Optimizing MapReduce for GPUs with effective shared memory usage Download

Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs Download

Optimizing Memory Efficiency for Deep Convolutional Neural Networks on GPUs Download

Optimizing memory management on heterogeneous systems using polyhedral, compile-time techniques Download Package

Optimizing Memory-Bound Numerical Kernels on GPU Hardware Accelerators Download Package

Optimizing Monte Carlo radiosity on graphics hardware

Optimizing Network Performance for Distributed DNN Training on GPU Clusters: ImageNet/AlexNet Training in 1.5 Minutes Download

Optimizing OpenCL Kernels for Iterative Statistical Applications on GPUs Download

Optimizing OpenCL Local Work Group Size With Machine Learning Download

Optimizing Performance and Energy Efficiency in Massively Parallel Systems Download Package

Optimizing Performance of Recurrent Neural Networks on GPUs Download Package

Optimizing Performance of Stencil Code with SPL Conqueror Download Package

Optimizing performance per watt on GPUs in High Performance Computing: temperature, frequency and voltage effects Download Package

Optimizing RDF stores by coupling General-purpose Graphics Processing Units and Central Processing Units Download

Optimizing Real Time GPU Kernels Using Fuzzy Inference System Download

Optimizing Similarity Computations for Ontology Matching – Experiences from GOMMA Download

Optimizing simulated annealing on GPU: A case study with IC floorplanning

Optimizing Smith-Waterman algorithm on Graphics Processing Unit

Optimizing Sparse Matrix-Matrix Multiplication for the GPU Download

Optimizing Sparse Matrix-Vector Multiplication on Emerging Many-Core Architectures Download

Optimizing Stencil Computations for NVIDIA Kepler GPUs Download Package

Optimizing strassen matrix multiply on GPUs Download

Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures Download Package

Optimizing Sweep3D for Graphic Processor Unit Download

Optimizing Symmetric Dense Matrix-Vector Multiplication on GPUs Download Package

Optimizing the Computation of Eigenvalues Using Graphics Processing Units Download

Optimizing the exploitation of multicore processors and GPUs with OpenMP and OpenCL

Optimizing the Linear Fascicle Evaluation Algorithm for Multi-Core and Many-Core Systems Download

Optimizing the MapReduce Framework on Intel Xeon Phi Coprocessor Download

Optimizing the multipole-to-local operator in the fast multipole method for graphical processing units Download Package

Optimizing the Performance of Parallel and Concurrent Applications Based on Asynchronous Many-Task Runtimes Download Package

Optimizing the SUSAN corner detection algorithm for a high speed FPGA implementation

Optimizing the Weather Research and Forecasting Model with OpenMP Offload and Codee Download Package

Optimizing Urban Environmental Simulations using Boinc Download

Optimizing Web Virtual Reality Download

Optimizing Xeon Phi for Interactive Data Analysis Download

OptiML: An implicitly parallel domain-specific language for machine learning Download

Optimum Application Deployment Technology for Heterogeneous IaaS Cloud Download

Option Pricing on the GPU

Option pricing with COS method on graphics processing units

Option pricing with multi-dimensional quadrature architectures Download

OptiX: a general purpose ray tracing engine Download

Orca: FSS-based Secure Training with GPUs Download

Orchestrated Scheduling and Prefetching for GPGPUs Download

Orchestrating Multiple Data-Parallel Kernels on Multiple Devices Download

Orchestrating Thread Scheduling and Cache Management to Improve Memory System Throughput in Throughput Processors Download

Orchestration by approximation: mapping stream programs onto multicore architectures Download

Orders-of-magnitude performance increases in GPU-accelerated correlation of images from the International Space Station Download

Origami: A Convolutional Network Accelerator Download

Orion: Interference-aware, Fine-grained GPU Sharing for ML Applications Download Package

Orthogonalization on a General Purpose Graphics Processing Unit with Double Double and Quad Double Arithmetic Download

Orthogononalization on a general purpose graphics processing unit with double double and quad double arithmetic Download

Orthorectification by Using GPGPU Method Download

Out of kernel tuning and optimizations for portable large-scale docking experiments on GPUs Download

Out-of-core cone beam reconstruction using multiple GPUs Download

Out-of-core Implementation for Accelerator Kernels on Heterogeneous Clouds Download Package

Out-of-core singular value decomposition Download

Out-of-core Training for Extremely Large-Scale Neural Networks With Adaptive Window-Based Scheduling Download Package

Out-of-the-box library support for DBMS operations on GPUs Download Package

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer Download

Over-synchronization in GPU Programs Download Package

Overcoming the GPU memory limitation on FDTD through the use of overlapping subgrids

Overcomplete Dictionary Learning with Jacobi Atom Updates Download

Overdetermined Shooting Methods for Computing Standing Water Waves with Spectral Accuracy Download

Overhauling SC atomics in C11 and OpenCL Download

Overlap fermions on GPUs Download

Overlapping Computation and Communication for Advection on Hybrid Parallel Computers Download

Overlapping computation and communication of three-dimensional FDTD on a GPU cluster Download

Overtaking CPU DBMSes with a GPU in Whole-Query Analytic Processing with Parallelism-Friendly Execution Plan Optimization Download

Overview of approaches for accelerating scale invariant feature detection algorithm

Overview of implementation of DARPA GPU program in SAIC

OWL: Cooperative Thread Array Aware Scheduling Techniques for Improving GPGPU Performance Download

Owl: Differential-based Side-Channel Leakage Detection for CUDA Applications Download Package

P-HGRMS: A Parallel Hypergraph Based Root Mean Square Algorithm for Image Denoising Download

PacketShader: a GPU-accelerated software router Download

Padding Free Bank Conflict Resolution for CUDA-Based Matrix Transpose Algorithm Download

Pairwise Sequence Alignment for Very Long Sequences on GPUs Download

Pairwise Sequence Alignment with Gaps with GPU Download

PAKCK: Performance and Power Analysis of Key Computational Kernels on CPUs and GPUs Download

Panda: A Compiler Framework for Concurrent CPU-GPU Execution of 3D Stencil Computations on GPU-accelerated Supercomputers Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 90

Package packages: 23

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: