1173

Papers on hgpu.org (.txt-file)

Performance Traps in OpenCL for CPUs Download

Performance Tuning for CUDA-Accelerated Neighborhood Denoising Filters Download

Performance Tuning for GPU-Embedded Systems: Machine-Learning-based and Analytical Model-driven Tuning Methodologies Download

Performance Upper Bound Analysis and Optimization of SGEMM on Fermi and Kepler GPUs Download

Performance-Analysis-Based Acceleration of Image Quality Assessment Download

Performance-aware component composition for GPU-based systems Download

Performance-Correctness Challenges in Emerging Heterogeneous Multicore Processors Download

Performance-efficient mechanisms for managing irregularity in throughput processors Download

Performance-Oriented Neural Architecture Search Download

Performance-Portable Many-Core Plasma Simulations: Porting PIConGPU to OpenPower and Beyond Download Package

Performance/power assessment of CNN packages on embedded automotive platforms Download

Performant low-order matrix-free finite element kernels on GPU architectures Download Package

Performing DCT8x8 Computation on GPU Using NVIDIA CUDA Technology Download

Performing efficient NURBS modeling operations on the GPU Download

Performing with CUDA Download

PeriPy – A High Performance OpenCL Peridynamics Package Download Package

permGPU: Using graphics processing units in RNA microarray association studies Download Package

Permutation Index and GPU to Solve efficiently Many Queries Download

Persistent Kernels for Iterative Memory-bound GPU Applications Download Package

Persistent RNNs: Stashing Recurrent Weights On-Chip Download Package

Perturbation Functions in Computer Graphics Download

Petaflop biofluidics simulations on a two million-core system Download

Petascale Application of a Coupled CPU-GPU Algorithm for Simulation and Analysis of Multiphase Flow Solutions in Porous Medium Systems Download

Petascale computations for Large-scale Atomic and Molecular collisions Download

Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures Download

Petascale elliptic solvers for anisotropic PDEs on GPU clusters Download

Petascale turbulence simulation using a highly parallel fast multipole method Download

Petascale visualization: Approaches and initial results Download

PFAC Library: GPU-based string matching algorithm Download Package

PFunc: modern task parallelism for modern high performance computing Package

PG-PuReMD: A Parallel-GPU Reactive Molecular Dynamics Package Download

PGEM: Preemptive GPGPU Execution Model for Runtime Engines Download Package

Pgx: Hardware-accelerated parallel game simulation for reinforcement learning Download Package

Phase Aware Memory Scheduling Download

Phase Based Volume Registration on the GPU with Application to Quantitative MRI Download

Phase Based Volume Registration Using CUDA Download

Phase diagram and critical behavior of the square-lattice Ising model with competing nearest- and next-nearest-neighbor interactions Download

Phase Transition in 3d Heisenberg Spin Glasses with Strong Random Anisotropies, through a Multi-GPU Parallelization Download

phiGEMM: a CPU-GPU library for porting Quantum ESPRESSO on hybrid systems Download Package

Phoenix: A Runtime Environment for High Performance Computing on Chip Multiprocessors Download

Photon mapping on programmable graphics hardware Download

Physical and graphical effects in OpenCL by example

Physical modeling and high-performance GPU computing for characterization, interception, and disruption of hazardous near-Earth objects Download

Physically Based Rendering: Implementation of Path Tracer Download

Physically-Based Interactive Flow Visualization Based on Schlieren and Interferometry Experimental Techniques Download

Physically-based interactive schlieren flow visualization Download

Physically-based painting style 3D image synthesis using GPU

Physically-Based Sound Synthesis on GPUs Download

Physically-based visual simulation on graphics hardware Download

Physics and Computing Performance of the Exa.TrkX TrackML Pipeline Download Package

Physis: An Implicitly Parallel Programming Model for Stencil Computations on Large-Scale GPU-Accelerated Supercomputers Download Package

Piccolo: building fast, distributed programs with partitioned tables Download

PIConGPU: A Fully Relativistic Particle-in-Cell Code for a GPU Cluster Download

PIConGPU: Predictive Simulations of Laser-Particle Accelerators with Manycore Hardware Download Package

Piecewise Tri-linear Contouring for Multi-material Volumes Download

PIGEON: Optimizing CUDA Code Generator for End-to-End Training and Inference of Relational Graph Neural Networks Download

Piko: A Design Framework for Programmable Graphics Pipelines Download

PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework Download

PipeCNN: An OpenCL-Based FPGA Accelerator for Large-Scale Convolution Neuron Networks Download Package

Pipeline strategies to accelerate range query processing on a multi-GPU environment Download

Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units Download Package

Pipelined MapReduce: A Decoupled MapReduce RunTime for Shared Memory Multi-Processors Download

Pipelined Training with Stale Weights of Deep Convolutional Neural Networks Download

Pipelining the Fast Multipole Method over a Runtime System Download

PIPS Is not (just) Polyhedral Software Download

PIR: PMaC’s Idiom Recognizer Download

PISTON: A Portable Cross-Platform Framework for Data-Parallel Visualization Operators Download Package

Pixel-Exact Rendering of Spacetime Finite Element Solutions Download

PixelPie: Maximal Poisson-disk Sampling with Rasterization Download

Places205-VGGNet Models for Scene Recognition Download Package

Planetary-Scale Terrain Composition

Plant Leaf Modeling and Rendering Based-On GPU

Plasma Visualization in Parallel using Particle Systems on Graphical Processing Units Download

Platform 2012, a Many-Core Computing Accelerator for Embedded SoCs: Performance Evaluation of Visual Analytics Applications Download

Platform Characterization for Domain-Specific Computing Download

Platform-independent parallelization of the Lattice Boltzmann method with OpenCL Download

Platform-Specific Optimization and Mapping of Stencil Codes through Refinement Download

Playdoh: A lightweight Python library for distributed computing and optimisation Download Package

PLB-HeC: A Profile-based Load-Balancing algorithm for Heterogeneous CPU-GPU Clusters Download

Plenoptic Rendering With Interactive Performance Using GPUs Download

PlinkGPU: A Framework for GPU Acceleration of Whole Genome Data Analysis Download

PM4Py-GPU: a High-Performance General-Purpose Library for Process Mining Download Package

PMT: Power Measurement Toolkit Download Package

PNG1 triangles for tangent plane continuous surfaces on the GPU Download

PoCL-R: A Scalable Low Latency Distributed OpenCL Runtime Download Package

PoCL-R: An Open Standard Based Offloading Layer for Heterogeneous Multi-Access Edge Computing with Server Side Scalability Download Package

pocl: A Performance-Portable OpenCL Implementation Download

Point Based Approximate Color Bleeding With Cuda Download

Point Based Color Bleeding with CUDA and Caching Download

Point Rendering in CUDA Path Tracer Download

Point Spread Function Estimation of Solar Surface Images with a Cooperative Particle Swarm Optimization on GPUs Download Package

Point to Line Mappings and Other Line Parameterizations not only for Hough Transform Download

Point to point processing of digital images using parallel computing Download

Point-wise Adaptive Filtering for Fast Monte Carlo Noise Reduction Download

Pointer Analysis for Semi-Automatic Code Parallelizers Download

Poisson-Boltzmann model for protein-surface electrostatic interactions and grid-convergence study using the PyGBe code Download Package

Policy-based Tuning for Performance Portability and Library Co-optimization Download

Polly – Polyhedral optimization in LLVM Download

Polly-ACC: Transparent compilation to heterogeneous hardware Download

Polyconvexification of the multi-label optical flow problem Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 95

Package packages: 25

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: