1173

Papers on hgpu.org (.txt-file)

Performance Evaluation of Quicksort with GPU Dynamic Parallelism for Gene-Expression Quantile Normalization Download

Performance Evaluation of R with Intel Xeon Phi Coprocessor Download Package

Performance Evaluation of Sparse Matrix Multiplication Kernels on Intel Xeon Phi Download

Performance Evaluation of the Intel Many Integrated Core Architecture for 3D Image Reconstruction in Computed Tomography Download

Performance evaluation of the multi-device OpenCL FDTD solver

Performance Evaluation of the NVIDIA GeForce 8800 GTX GPU for Machine Learning Download

Performance Evaluation of the Ocean-Land-Atmosphere Model Using Graphics Processing Units Download

Performance Evaluations of Document-Oriented Databases using GPU and Cache Structure Download

Performance Evaluations of Graph Database using CUDA and OpenMP-Compatible Libraries Download

Performance Exploration of Selected Manually and Automatically Parallelized Codes on GPUs Download

Performance Gains in Conjugate Gradient Computation with Linearly Connected GPU Multiprocessors Download

Performance Impact of Data Layout on the GPU-accelerated IDW Interpolation Download

Performance impact of dynamic parallelism on different clustering algorithms Download

Performance Impact of Memory Channels on Sparse and Irregular Algorithms Download

Performance Improvement of Data Mining in Weka through GPU Acceleration Download Package

Performance Improvement of Multichannel Audio by Graphics Processing Units Download

Performance Improvement of Optical Algorithms on Multicore Platforms Download

Performance Improvement of TOUGH2 Simulation with Graphics Processing Unit Download

Performance improvements for iterative electron tomography reconstruction using graphics processing units (GPUs) Download

Performance improvements of real-time crowd simulations Download

Performance in GPU Architectures: Potentials and Distances Download

Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs Download

Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs Download Package

Performance modeling of atomic additions on GPU scratchpad memory Download

Performance Modeling, Optimization, and Characterization on Heterogeneous Architectures Download

Performance Modelling and Traffic Characterisation of Optical Networks

Performance Modelling of Deep Learning on Intel Many Integrated Core Architectures Download

Performance models for CPU-GPU data transfers Download Package

Performance models for CUDA streams on NVIDIA GeForce series Download

Performance Models for Heterogeneous Iterative Programs Download

Performance Monitoring of Multi-FPGA Systems Download Package

Performance of a code migration for the simulation of supersonic ejector flow to SMP, MIC and GPU using OpenMP, OpenMP+LEO, and OpenACC directives Download

Performance of a GPU-based Direct Summation Algorithm for Computation of Small Angle Scattering Profile Download

Performance of CPU and GPU HPC Architectures for off-design aircraft simulation Download

Performance of FORTRAN and C GPU Extensions for a Benchmark Suite of Fourier Pseudospectral Algorithms Download Package

Performance of GPU for Pricing Financial Derivatives: Convertible Bonds Download

Performance of GTX Titan X GPUs and Code Optimization Download

Performance of Implicit Solver Strategies on GPUs Download

Performance of inverse atomistic scale fracture modeling on GPGPU architectures

Performance of Kepler GTX Titan GPUs and Xeon Phi System Download

Performance of OpenCL Download

Performance of Optical Flow Techniques on Graphics Hardware Download

Performance of PETSc GPU Implementation with Sparse Matrix Storage Schemes Download

Performance Optimisation of Smoothed Particle Hydrodynamics Algorithms for Multi/Many-Core Architectures Download

Performance Optimisations for Heterogeneous Managed Runtime Systems Download

Performance Optimization of 3-D Lattice Boltzmann Flow Solver on a GPU Download

Performance Optimization of Clustering On GPU Download

Performance Optimization of Deep Learning Sparse Matrix Kernels on Intel Max Series GPU Download

Performance Optimization of GPU ELF-Codes

Performance Optimization of Memory Intensive Applications on FPGA Accelerator Download

Performance Optimization of Vision Apps on Mobile Application Processor Download

Performance Optimization using Multimodal Modeling and Heterogeneous GNN Download

Performance Optimization Using Partitioned SpMV on GPUs and Multicore CPUs Download

Performance optimizations for scalable CFD applications on hybrid CPU+MIC heterogeneous computing system with millions of cores Download

Performance portability analysis of SYCL with a classical CG on CPU, GPU, and FPGA Download

Performance Portability and Evaluation of Heterogeneous Components of SeisSol Targeted to Upcoming Intel HPC GPUs Download Package

Performance Portability Challenges for Fortran Applications Download Package

Performance Portability Evaluation for OpenACC on Intel Knights Corner and Nvidia Kepler Download

Performance portability evaluation of blocked stencil computations on GPUs Download Package

Performance Portability in Accelerated Parallel Kernels Download

Performance Portability of a GPU Enabled Factorization with the DAGuE Framework Download

Performance Portability of the Aeras Atmosphere Model to Next Generation Architectures using Kokkos Download

Performance Portability Strategies for Computational Fluid Dynamics (CFD) Applications on HPC Systems Download

Performance portability study of epistasis detection using SYCL on NVIDIA GPU Download

Performance Portability Study of Linear Algebra Kernels in OpenCL Download

Performance portability through machine learning guided kernel selection in SYCL libraries Download Package

Performance portability via C++ PSTL, SYCL, OpenMP, and HIP: the Gaia AVU-GSR case study Download Package

Performance Portability with the Chapel Language Download

Performance Portable GPU Code Generation for Matrix Multiplication Download

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs Download Package

Performance potential for simulating spin models on GPU Download Package

Performance prediction of deep learning applications training in GPU as a service systems Download

Performance Predictions for General-Purpose Computation on GPUs

Performance study of filtered back-projection algorithms implemented on GPUs Download

Performance study of interference on GPU and CPU resources with multiple applications

Performance Study of LU Decomposition on the Programmable GPU Download

Performance study of mapping irregular computations on GPUs Download

Performance Study of Satellite Image Processing on Graphics Processors Unit Using CUDA Download

Performance study of using the Direct Compute API for implementing Support vector machines on GPUs Download Package

Performance study on GPU offloading techniques using the Gauss matrix inverse algorithm Download Package

Performance Testing of GPU-Based Approximate Matching Algorithm on Network Traffic Download

Performance Tradeoff Spectrum of Integer and Floating Point Applications Download

Performance Tradeoff Spectrum of Integer and Floating Point Applications Kernels on Various GPUs Download

Performance Traps in OpenCL for CPUs Download

Performance Tuning for CUDA-Accelerated Neighborhood Denoising Filters Download

Performance Tuning for GPU-Embedded Systems: Machine-Learning-based and Analytical Model-driven Tuning Methodologies Download

Performance Upper Bound Analysis and Optimization of SGEMM on Fermi and Kepler GPUs Download

Performance-Analysis-Based Acceleration of Image Quality Assessment Download

Performance-aware component composition for GPU-based systems Download

Performance-Correctness Challenges in Emerging Heterogeneous Multicore Processors Download

Performance-efficient mechanisms for managing irregularity in throughput processors Download

Performance-Oriented Neural Architecture Search Download

Performance-Portable Many-Core Plasma Simulations: Porting PIConGPU to OpenPower and Beyond Download Package

Performance/power assessment of CNN packages on embedded automotive platforms Download

Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data Movement Download

Performant low-order matrix-free finite element kernels on GPU architectures Download Package

Performing DCT8x8 Computation on GPU Using NVIDIA CUDA Technology Download

Performing efficient NURBS modeling operations on the GPU Download

Performing with CUDA Download

PeriPy – A High Performance OpenCL Peridynamics Package Download Package

 

Brief statistics for this page

Titles: 100

Download open PDFs: 94

Package packages: 18

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org