1173

Papers on hgpu.org (.txt-file)

PRAND: GPU accelerated parallel random number generation library: Using most reliable algorithms and applying parallelism of modern GPUs and CPUs Download Package

Precise dynamic analysis for slack elasticity: adding buffering without adding bugs Download

Precise Energy Consumption Measurements of Heterogeneous Artificial Intelligence Workloads Download Package

Precision and Performance Analysis of C Standard Math Library Functions on GPUs Download

Precision and Performance: Floating Point and IEEE 754 Compliance for NVIDIA GPUs Download

Precision-Aware Soft Error Protection for GPUs Download

Precomputed Atmospheric Scattering Download Package

Precomputed compressive sensing for light transport acquisition Download

Precomputed Visibility Cuts for Interactive Relighting with Dynamic BRDFs Download

Preconditioned conjugate gradient solver for structural problems Download

Predictable GPGPU Computing in DNN-Driven Autonomous Systems Download

Predicting GPUDirect Benefits for HPC Workloads Download

Predicting NVIDIA’s Next-Day Stock Price: A Comparative Analysis of LSTM, MLP, ARIMA, and ARIMA-GARCH Models Download

Predicting the Execution Time of a kernel on a specific GPU using PTX code Download

Prediction of Performance and Power Consumption of GPGPU Applications Download Package

Predictive Data Race Detection for GPUs Download

Predictive Lazy Amplification: Synthesis and Rendering of Massive Procedural Scenes in Real Time Download Package

Predictive Modeling and Analysis of OP2 on Distributed Memory GPU Clusters Download

Predictive Runtime Code Scheduling for Heterogeneous Architectures Download

Preemptive Thread Block Scheduling with Online Structural Runtime Prediction for Concurrent GPGPU Kernels Download

Prefiltered Single Scattering Download

Preliminary Experiences with the Uintah Framework on Intel Xeon Phi and Stampede Download

Preliminary Experiments with XKaapi on Intel Xeon Phi Coprocessor Download

Preliminary implementation of two parallel programs for fractal image coding on GPUs

Preliminary implementation of VQ image coding using GPGPU

Preliminary report: Initial evaluation of StdPar implementations on AMD GPUs for HPC Download

Preliminary results of autotuning GEMM kernels for the NVIDIA Kepler architecture-GeForce GTX 680 Download

Preparing Ginkgo for AMD GPUs – A Testimonial on Porting CUDA Code to HIP Download Package

Pretty Good Accuracy in Matrix Multiplication with GPUs Download

Pricing composable contracts on the GP-GPU Download

Pricing of cross-currency interest rate derivatives on Graphics Processing Units Download

Pricing the American Option Using Reconfigurable Hardware Download

Primal Dual Affine Scaling on GPUs Download

Principal Kernel Analysis: A Tractable Methodology to Simulate Scaled GPU Workloads Download

Principles for Automated and Reproducible Benchmarking Download Package

Principles towards Real-Time Simulation of Material Point Method on Modern GPUs Download

Principles, Techniques, and Tools for Explicit and Automatic Parallelization Download Package

Priority-Based Task Management in a GPGPU Megakernel Download

PRISM-PSY: Precise GPU-Accelerated Parameter Synthesis for Stochastic Systems Download

Prius: A Runtime for Hybrid Computing Download

PRNG Random Numbers on GPU Download Package

Probabilistic View-based 3D Curve Skeleton Computation on the GPU Download Package

Probing biomolecular machines with graphics processors Download

Probing the Statistical Validity of the Ductile-to-Brittle Transition in Metallic Nanowires Using GPU Computing Download

Process Time Comparison between GPU and CPU Download

Processing Big Data in Main Memory and on GPU Download

Processing data streams with hard real-time constraints on heterogeneous systems Download

Processing Hard Sphere Collisions on a GPU Using OpenCL Download

Processing Large-scale XML Files on GPGPU Cluster Download

Processing Markov Logic Networks with GPUs Download

Processing MPI Derived Datatypes on Noncontiguous GPU-Resident Data Download

Processing Neocognitron of Face Recognition on High Performance Environment Based on GPU with CUDA Architecture Download

Processing of synthetic Aperture Radar data with GPGPU

Processing OLTP Workloads on Hybrid CPU/GPU Systems Download

Processing Posting Lists Using OpenCL Download

Processing XPath Structural Constraints on GPU Download

Production Floating Point Applications on FPGAs Download

Production Level CFD Code Acceleration for Hybrid Many-Core Architectures Download

Productive and Efficient Computational Science Through Domain-specific Abstractions Download

Productive High Performance Parallel Programming with Auto-tuned Domain-Specific Embedded Languages Download

Productive Performance Engineering for Weather and Climate Modeling with Python Download

Productivity, Portability, Performance: Data-Centric Python Download

Professional CUDA C Programming Download

Profile Util library: A quick and easy way to get MPI, OpenMP and GPU runtime information Download Package

Profile-guided optimization of critical medical imaging algorithms Download

Profiling based Out-of-core Hybrid Method for Large Neural Networks Download

Profiling General Purpose GPU Applications Download

Profiling Heterogeneous Multi-GPU Systems to Accelerate Cortically Inspired Learning Algorithms Download

Profiling High Level Heterogeneous Programs: Using the SPOC GPGPU framework for OCaml Download Package

Profiling of Data-Parallel Processors Download

Program Acceleration in a Heterogeneous Computing Environment Using OpenCL, FPGA, and CPU Download

Program Analysis and Machine Learning based Approach to Predict Power Consumption of CUDA Kernel Download Package

Program optimization carving for GPU computing?

Program Optimization of Array-Intensive SPEC2k Benchmarks on Multithreaded GPU Using CUDA and Brook+

Program Optimization of Stencil Based Application on the GPU-Accelerated System

Program optimization space pruning for a multithreaded gpu Download

Program Optimization Strategies for Data-Parallel Many-Core Processors Download

Program Optimization Study on a 128-Core GPU Download

PROGRAML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations Download Package

ProGraML: Graph-based Deep Learning for Program Optimization and Analysis Download Package

Programmability and Performance Portability Aspects of Heterogeneous Multi-/Manycore Systems Download

Programmability: Design Costs and Payoffs using AMD GPU Streaming Languages and Traditional Multi-Core Libraries Download

Programmable and Scalable Architecture for Graphics Processing Units Download

Programmable shaders for deformation rendering Download

Programming Abstractions and Optimization Techniques for GPU-based Heterogeneous Systems Download Package

Programming and Performance of Graphics Processors in Shock Waves Simulation by Finite Volume Method Download

Programming and Scheduling Model for Supporting Heterogeneous Accelerators in Linux Download Package

Programming Challenges for the Implementation of Numerical Quadrature in Atomic Physics on FPGA and GPU Accelerators

Programming CUDA and OpenCL: A Case Study Using Modern C++ Libraries Download Package

Programming Dense Linear Algebra Kernels on Vectorized Architectures Download

Programming Embedded Manycore: Refinement and Optimizing Compilation of a Parallel Action Language for Hierarchical State Machines Download

Programming finite-difference time-domain for graphics processor units using compute unified device architecture Download

Programming for scientific computing on peta-scale heterogeneous parallel systems Download

Programming framework for clusters with heterogeneous accelerators Download

Programming Frameworks for Distributed Smartphone Computing Download

Programming Future Parallel Architectures with Haskell and Intel ArBB Download

Programming GPUs with C++14 and Just-In-Time Compilation Download

Programming Heterogeneous Systems from an Image Processing DSL Download Package

Programming Heterogeneous Systems with General and Domain-Specific Frameworks Download Package

Programming hybrid systems with implicit memory based synchronization Download Package

 

Brief statistics for this page

Titles: 100

Download open PDFs: 93

Package packages: 21

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: