1173

Papers on hgpu.org (.txt-file)

Profiling based Out-of-core Hybrid Method for Large Neural Networks Download

Profiling Concurrent Vision Inference Workloads on NVIDIA Jetson – Extended Download

Profiling General Purpose GPU Applications Download

Profiling Heterogeneous Multi-GPU Systems to Accelerate Cortically Inspired Learning Algorithms Download

Profiling High Level Heterogeneous Programs: Using the SPOC GPGPU framework for OCaml Download Package

Profiling of Data-Parallel Processors Download

Program Acceleration in a Heterogeneous Computing Environment Using OpenCL, FPGA, and CPU Download

Program Analysis and Machine Learning based Approach to Predict Power Consumption of CUDA Kernel Download Package

Program optimization carving for GPU computing?

Program Optimization of Array-Intensive SPEC2k Benchmarks on Multithreaded GPU Using CUDA and Brook+

Program Optimization of Stencil Based Application on the GPU-Accelerated System

Program optimization space pruning for a multithreaded gpu Download

Program Optimization Strategies for Data-Parallel Many-Core Processors Download

Program Optimization Study on a 128-Core GPU Download

PROGRAML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations Download Package

ProGraML: Graph-based Deep Learning for Program Optimization and Analysis Download Package

Programmability and Performance Portability Aspects of Heterogeneous Multi-/Manycore Systems Download

Programmability: Design Costs and Payoffs using AMD GPU Streaming Languages and Traditional Multi-Core Libraries Download

Programmable and Scalable Architecture for Graphics Processing Units Download

Programmable shaders for deformation rendering Download

Programming Abstractions and Optimization Techniques for GPU-based Heterogeneous Systems Download Package

Programming and Performance of Graphics Processors in Shock Waves Simulation by Finite Volume Method Download

Programming and Scheduling Model for Supporting Heterogeneous Accelerators in Linux Download Package

Programming Challenges for the Implementation of Numerical Quadrature in Atomic Physics on FPGA and GPU Accelerators

Programming CUDA and OpenCL: A Case Study Using Modern C++ Libraries Download Package

Programming Dense Linear Algebra Kernels on Vectorized Architectures Download

Programming Embedded Manycore: Refinement and Optimizing Compilation of a Parallel Action Language for Hierarchical State Machines Download

Programming finite-difference time-domain for graphics processor units using compute unified device architecture Download

Programming for scientific computing on peta-scale heterogeneous parallel systems Download

Programming framework for clusters with heterogeneous accelerators Download

Programming Frameworks for Distributed Smartphone Computing Download

Programming Future Parallel Architectures with Haskell and Intel ArBB Download

Programming GPUs with C++14 and Just-In-Time Compilation Download

Programming Heterogeneous Systems from an Image Processing DSL Download Package

Programming Heterogeneous Systems with General and Domain-Specific Frameworks Download Package

Programming hybrid systems with implicit memory based synchronization Download Package

Programming in CUDA for Kepler and Maxwell Architecture Download

Programming issues for video analysis on Graphics Processing Units Download

Programming Many-Core Chips Download

Programming Massively Parallel Architectures using MARTE: a Case Study Download

Programming massively parallel processors : A Hands – on approach

Programming Massively Parallel Processors with CUDA (audio course) Download Package

Programming model for a heterogeneous x86 platform

Programming Models and Runtimes for Heterogeneous Systems Download

Programming Models and Scheduling Techniques for Heterogeneous Architectures Download

Programming Models and Tools for Many-Core Platforms Download

Programming NVIDIA cards by means of transitive closure based parallelization algorithms Download

Programming of shared memory GPUs shared memory systems Download

Programming on Parallel Machines: GPU, Multicore, Clusters and More Download Package

Programming video cards for computational electromagnetics applications

Programming with Explicit Dependencies. A Framework for Portable Parallel Programming Download

Programming-Model Centric Debugging for Multicore Embedded Systems Download

Progressive Clustering of Big Data with GPU Acceleration and Visualization Download

Progressive High-Quality Response Surfaces for Visually Guided Sensitivity Analysis Download

Progressive Photon Mapping on GPUs Download Package

Progressive Semantic Segmentation Download

Projected tetrahedra revisited: a barycentric formulation applied to digital radiograph reconstruction using higher-order attenuation functions Download

Projectile Monte-Carlo Trajectory Analysis Using a Graphics Processing Unit Download

Projecting Tetrahedra with a Simplified Basis Graph

PROJECTION Algorithm for Motif Finding on GPUs Download

Promise of embedded system with GPU in artificial leg control: Enabling time-frequency feature extraction from electromyography Download

Proposition for propagated occupation grids for non-rigid moving objects tracking Download

Prospects for scalable 3D FFTs on heterogeneous exascale systems Download Package

Prospects of GPGPU in the Auger Offline Software Framework Download

pROST : A Smoothed Lp-norm Robust Online Subspace Tracking Method for Realtime Background Subtraction in Video Download

PROST: Parallel robust online simple tracking Download

Protecting Real-Time GPU Applications on Integrated CPU-GPU SoC Platforms Download

Protein alignment algorithms with an efficient backtracking routine on multiple GPUs Download Package

Proteus: Efficient Resource Use in Heterogeneous Architectures Download

Proteus: Exploiting Numerical Precision Variability in Deep Neural Networks Download

Prototyping flexible touch screen devices using collocated haptic-graphic elastic-object deformation on the GPU

Prototyping methodology of image processing applications on heterogeneous parallel systems Download

ProtoX: A First Look Download

Provably Efficient GPU Algorithms Download

Providing performance portable numerics for Intel GPUs Download Package

Providing Source Code Level Portability Between CPU and GPU with MapCG Download

PSCToolkit: solving sparse linear systems with a large number of GPUs Download Package

Pseudo Random Number Generators on Graphics Processing Units, with Applications in Finance Download

Pseudo-random number generation for Brownian Dynamics and Dissipative Particle Dynamics simulations on GPU devices Package

Pseudo-Random Number Generation on GP-GPU

Pseudo-random number generators for Monte Carlo simulations on ATI Graphics Processing Units

Pseudo-random number generators for Monte Carlo simulations on Graphics Processing Units Download

Pseudorandom number generation on the GPU Download

Pseudorandom Numbers Generation for Monte Carlo Simulations on GPUs: OpenCL Approach Download Package

Pseudoscalar Meson in Two Flavors QCD with the Optimal Domain-Wall Fermion Download

pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations Download

PTask: Operating System Abstractions To Manage GPUs as Compute Devices Download

PTX2Kernel: Converting PTX Code into Compilable Kernels Download

PUGACE, a cellular Evolutionary Algorithm framework on GPUs Download

Pulsar Acceleration Searches on the GPU for the Square Kilometre Array Download

Pulsar search acceleration using FPGAs and OpenCL templates Download Package

Pulse-coupled neural network performance for real-time identification of vegetation during forced landing Download

Purine: A bi-graph based deep learning framework Download

Pushing the Envelope: Extreme Network Coding on the GPU Download

Pushing the limit of molecular dynamics with ab initio accuracy to 100 million atoms with machine learning Download

Pushing the limits for medical image reconstruction on recent standard multicore processors Download

Putting Automatic Polyhedral Compilation for GPGPU to Work Download Package

pVOCL: Power-Aware Dynamic Placement and Migration in Virtualized GPU Environments Download

PVR: Patch-to-Volume Reconstruction for Large Area Motion Correction of Fetal MRI Download Package

pyATF: Constraint-Based Auto-Tuning in Python Download Package

 

Brief statistics for this page

Titles: 100

Download open PDFs: 88

Package packages: 23

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: