1173

Papers on hgpu.org (.txt-file)

Performance Optimization of 3-D Lattice Boltzmann Flow Solver on a GPU Download

Performance Optimization of Clustering On GPU Download

Performance Optimization of Deep Learning Sparse Matrix Kernels on Intel Max Series GPU Download

Performance Optimization of GPU ELF-Codes

Performance Optimization of Memory Intensive Applications on FPGA Accelerator Download

Performance Optimization of Vision Apps on Mobile Application Processor Download

Performance Optimization using Multimodal Modeling and Heterogeneous GNN Download

Performance Optimization Using Partitioned SpMV on GPUs and Multicore CPUs Download

Performance optimizations for scalable CFD applications on hybrid CPU+MIC heterogeneous computing system with millions of cores Download

Performance portability analysis of SYCL with a classical CG on CPU, GPU, and FPGA Download

Performance Portability and Evaluation of Heterogeneous Components of SeisSol Targeted to Upcoming Intel HPC GPUs Download Package

Performance Portability Challenges for Fortran Applications Download Package

Performance Portability Evaluation for OpenACC on Intel Knights Corner and Nvidia Kepler Download

Performance portability evaluation of blocked stencil computations on GPUs Download Package

Performance Portability in Accelerated Parallel Kernels Download

Performance Portability of a GPU Enabled Factorization with the DAGuE Framework Download

Performance Portability of the Aeras Atmosphere Model to Next Generation Architectures using Kokkos Download

Performance Portability Strategies for Computational Fluid Dynamics (CFD) Applications on HPC Systems Download

Performance portability study of epistasis detection using SYCL on NVIDIA GPU Download

Performance Portability Study of Linear Algebra Kernels in OpenCL Download

Performance portability through machine learning guided kernel selection in SYCL libraries Download Package

Performance portability via C++ PSTL, SYCL, OpenMP, and HIP: the Gaia AVU-GSR case study Download Package

Performance Portability with the Chapel Language Download

Performance Portable GPU Code Generation for Matrix Multiplication Download

Performance Portable Gradient Computations Using Source Transformation Download

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs Download Package

Performance potential for simulating spin models on GPU Download Package

Performance prediction of deep learning applications training in GPU as a service systems Download

Performance Predictions for General-Purpose Computation on GPUs

Performance study of filtered back-projection algorithms implemented on GPUs Download

Performance study of interference on GPU and CPU resources with multiple applications

Performance Study of LU Decomposition on the Programmable GPU Download

Performance study of mapping irregular computations on GPUs Download

Performance Study of Satellite Image Processing on Graphics Processors Unit Using CUDA Download

Performance study of using the Direct Compute API for implementing Support vector machines on GPUs Download Package

Performance study on GPU offloading techniques using the Gauss matrix inverse algorithm Download Package

Performance Testing of GPU-Based Approximate Matching Algorithm on Network Traffic Download

Performance Tradeoff Spectrum of Integer and Floating Point Applications Download

Performance Tradeoff Spectrum of Integer and Floating Point Applications Kernels on Various GPUs Download

Performance Traps in OpenCL for CPUs Download

Performance Tuning for CUDA-Accelerated Neighborhood Denoising Filters Download

Performance Tuning for GPU-Embedded Systems: Machine-Learning-based and Analytical Model-driven Tuning Methodologies Download

Performance Upper Bound Analysis and Optimization of SGEMM on Fermi and Kepler GPUs Download

Performance-Analysis-Based Acceleration of Image Quality Assessment Download

Performance-aware component composition for GPU-based systems Download

Performance-Correctness Challenges in Emerging Heterogeneous Multicore Processors Download

Performance-efficient mechanisms for managing irregularity in throughput processors Download

Performance-Oriented Neural Architecture Search Download

Performance-Portable Many-Core Plasma Simulations: Porting PIConGPU to OpenPower and Beyond Download Package

Performance/power assessment of CNN packages on embedded automotive platforms Download

Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data Movement Download

Performant low-order matrix-free finite element kernels on GPU architectures Download Package

Performant Unified GPU Kernels for Portable Singular Value Computation Across Hardware and Precision Download

Performing DCT8x8 Computation on GPU Using NVIDIA CUDA Technology Download

Performing efficient NURBS modeling operations on the GPU Download

Performing with CUDA Download

PeriPy – A High Performance OpenCL Peridynamics Package Download Package

permGPU: Using graphics processing units in RNA microarray association studies Download Package

Permutation Index and GPU to Solve efficiently Many Queries Download

Persistent Kernels for Iterative Memory-bound GPU Applications Download Package

Persistent RNNs: Stashing Recurrent Weights On-Chip Download Package

Perturbation Functions in Computer Graphics Download

Petaflop biofluidics simulations on a two million-core system Download

Petascale Application of a Coupled CPU-GPU Algorithm for Simulation and Analysis of Multiphase Flow Solutions in Porous Medium Systems Download

Petascale computations for Large-scale Atomic and Molecular collisions Download

Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures Download

Petascale elliptic solvers for anisotropic PDEs on GPU clusters Download

Petascale turbulence simulation using a highly parallel fast multipole method Download

Petascale visualization: Approaches and initial results Download

PFAC Library: GPU-based string matching algorithm Download Package

PFunc: modern task parallelism for modern high performance computing Package

PG-PuReMD: A Parallel-GPU Reactive Molecular Dynamics Package Download

PGEM: Preemptive GPGPU Execution Model for Runtime Engines Download Package

Pgx: Hardware-accelerated parallel game simulation for reinforcement learning Download Package

Phase Aware Memory Scheduling Download

Phase Based Volume Registration on the GPU with Application to Quantitative MRI Download

Phase Based Volume Registration Using CUDA Download

Phase diagram and critical behavior of the square-lattice Ising model with competing nearest- and next-nearest-neighbor interactions Download

Phase Transition in 3d Heisenberg Spin Glasses with Strong Random Anisotropies, through a Multi-GPU Parallelization Download

phiGEMM: a CPU-GPU library for porting Quantum ESPRESSO on hybrid systems Download Package

Phoenix: A Runtime Environment for High Performance Computing on Chip Multiprocessors Download

Photon mapping on programmable graphics hardware Download

Physical and graphical effects in OpenCL by example

Physical modeling and high-performance GPU computing for characterization, interception, and disruption of hazardous near-Earth objects Download

Physically Based Rendering: Implementation of Path Tracer Download

Physically-Based Interactive Flow Visualization Based on Schlieren and Interferometry Experimental Techniques Download

Physically-based interactive schlieren flow visualization Download

Physically-based painting style 3D image synthesis using GPU

Physically-Based Sound Synthesis on GPUs Download

Physically-based visual simulation on graphics hardware Download

Physics and Computing Performance of the Exa.TrkX TrackML Pipeline Download Package

Physis: An Implicitly Parallel Programming Model for Stencil Computations on Large-Scale GPU-Accelerated Supercomputers Download Package

Piccolo: building fast, distributed programs with partitioned tables Download

PIConGPU: A Fully Relativistic Particle-in-Cell Code for a GPU Cluster Download

PIConGPU: Predictive Simulations of Laser-Particle Accelerators with Manycore Hardware Download Package

Piecewise Tri-linear Contouring for Multi-material Volumes Download

PIGEON: Optimizing CUDA Code Generator for End-to-End Training and Inference of Relational Graph Neural Networks Download

Piko: A Design Framework for Programmable Graphics Pipelines Download

PILC: Practical Image Lossless Compression with an End-to-end GPU Oriented Neural Framework Download

PipeCNN: An OpenCL-Based FPGA Accelerator for Large-Scale Convolution Neuron Networks Download Package

 

Brief statistics for this page

Titles: 100

Download open PDFs: 94

Package packages: 24

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us: