1173

Papers on hgpu.org (.txt-file)

Analyzing Locality of Memory References in GPU Architectures Download

Analyzing Memory Accesses for Performance and Correctness of Parallel Programs Download

Analyzing Optimization Techniques for Power Efficiency on Heterogeneous Platforms Download

Analyzing Password Strength and Efficient Password Cracking Download

Analyzing program flow within a many-kernel OpenCL application Download

Analyzing Resource Utilization in an HPC System: A Case Study of NERSC’s Perlmutter Download

Analyzing Soft-Error Vulnerability on GPGPU Microarchitecture Download

Analyzing the CUDA Applications with its Latency and Bandwidth Tolerance Download

Analyzing throughput of GPGPUs exploiting within-die core-to-core frequency variation

Analyzing Use of OpenCL on the Cell Broadband Engine and a Proposal for OpenCL Extensions Download

Anatomizing Deep Learning Inference in Web Browsers Download Package

Anatomy Of High-Performance Deep Learning Convolutions On SIMD Architectures Download Package

Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs Download Package

Anatomy of High-Performance Many-Threaded Matrix Multiplication Download Package

Android Malware Classification Using Parallelized Machine Learning Methods Download

ANGHABENCH: a Suite with One Million Compilable C Benchmarks for Code-Size Reduction Download Package

Animating physically based explosions in real-time

Animation of Orthogonal Texture Patterns for Vector Field Visualization Download

Anisotropic interfacial tension, contact angles, and line tensions: A graphics-processing-unit-based Monte Carlo study of the Ising model Download

Anisotropic Kuwahara Filtering on the GPU

Anisotropic mesh coarsening and refinement on GPU architecture Download

Anisotropic noise Download

Anomalous behaviour detection using spatiotemporal oriented energies, subset inclusion histogram comparison and event-driven processing Download

Anomalous metastability in a temperature-driven transition Download

Anomalous Structure and Scaling of Ring Polymer Brushes Download

Ansor: Generating High-Performance Tensor Programs for Deep Learning Download

Anti-parallel Patterns in Fine-grain Data-parallel Programs Download

ANTS2 package: simulation and experimental data processing for Anger camera type detectors Download Package

AnyHLS: High-Level Synthesis with Partial Evaluation Download Package

AnySeq/GPU: A Novel Approach for Faster Sequence Alignment on GPUs Download Package

AnySL: efficient and portable shading for ray tracing Download Package

Anytime Algorithms for GPU Architectures Download

APACE: AlphaFold2 and advanced computing as a service for accelerated discovery in biophysics Download Package

APEnet+: a 3D toroidal network enabling Petaflops scale Lattice QCD simulations on commodity clusters Download

APEnet+: high bandwidth 3D torus direct network for petaflops scale commodity clusters Download

APHOG: A Framework for Fast Object Detection Using Histograms of Oriented Gradients Download

API-Compiling for Image Hardware Accelerators Download

APL on GPUs: A TAIL from the Past, Scribbled in Futhark Download Package

APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores Download Package

APOGEE: adaptive prefetching on GPUs for energy efficiency Download

Apple Silicon Performance in Scientific Computing Download

Applicability of GPU Computing for Efficient Merge in In-Memory Databases Download

Application level energy measurements and models for hybrid platform with accelerators Download

Application of Assembly of Finite Element Methods on Graphics Processors for Real-Time Elastodynamics Download

Application of Deep-Learning to Compiler-Based Graphs Download

Application of GPGPU for Acceleration of Short DNA Sequence Alignment in Unipro UGENE Project Download Package

Application of GPU Computing to Some Urban Traffic Problems Download

Application of GPU Smooth Particle Hydrodynamics: Wave Runup and Overtopping on Composite Slopes Download Package

Application of GPUs for the Calculation of Two Point Correlation Functions in Cosmology Download Package

Application of Graphics Processing Units to Search Pipeline for Gravitational Waves from Coalescing Binaries of Compact Objects Download

Application of performance portability solutions for GPUs and many-core CPUs to track reconstruction kernels Download

Application of the Characteristic Basis Function Method using CUDA Download

Application of the Mean Field Methods to MRF Optimization in Computer Vision Download

Application of the OpenCL API for Implementation of the NIPALS Algorithm for Principal Component Analysis of Large Data Sets Download

Application Performance Profiling on Intel GPUs with Oneprof and Onetrace Download Package

Application Synthesis and Optimization on Heterogeneous Parallel Processing Systems Download

Application-guided tool development for architecturally diverse computation Download

Application-independent accurate mouse placements on surfaces of arbitrary geometry Download

Applications of Deep Neural Networks Download Package

Applications of Linux-Based QT-CUDA Parallel Architecture Download

Applications of Many-Core Technologies to On-line Event Reconstruction in High Energy Physics Experiments Download

Applications Performance on GPGPUs with the Fermi Architecture Download

Applying Contact Angle to a Two-Dimensional Smoothed Particle Hydrodynamics (SPH) model on a Graphics Processing Unit (GPU) Platform Download

Applying Genetic Algorithms to Tune Heterogeneous Platform Configurations Download

Applying GPU Dynamic Parallelism to High-Performance Normalization of Gene Expressions Download

Applying graphics processor acceleration in a software defined radio prototyping environment Download

Applying Object Oriented Design Patterns to CUDA based Pyramidal Image Blending – An Experience Download

Applying OOC Techniques in the Reduction to Condensed Form for Very Large Symmetric Eigenproblems on GPUs Download

Applying software-managed caching and CPU/GPU task scheduling for accelerating dynamic workloads Download Package

Applying Source Level Auto-Vectorization to Aparapi Java Download

Applying the “Simple Accelerator Modelling in MATLAB” (SAMM) Code to High Luminosity LHC Upgrade Download

Applying the Midas Touch of Reproducibility to High-Performance Computing Download

Applying the Parallel GPU Model to Radiation Therapy Treatment Download

Approaches for parallelizing reductions on modern GPUs

Approaches for the Parallelization of Software Implementation of Integer Multiplication Download

Approximate Belief Propagation by Hierarchical Averaging of Outgoing Messages Download

Approximate Dynamic Programming and Neural Networks on Game Hardware Download

Approximate dynamic programming with post-decision states as a solution method for dynamic economic models Download

Approximate Principal Direction Trees Download

Approximate Similarity Search for Online Multimedia Services on Distributed CPU-GPU Platforms Download

Approximate Subdivision Surface Evaluation in the Language of Linear Algebra Download

Approximation of BEM matrices using GPGPUs Download

Approximation of Loop Subdivision Surfaces for Fast Rendering

Approximative inference for multivariate functional data on massively parallel processors Download

APPy: Annotated Parallelism for Python on GPUs Download

APTCC: Auto Parallelizing Translator From C To CUDA Download

APUNet: Revitalizing GPU as Packet Processing Accelerator Download

AQsort: Scalable Multi-Array In-Place Sorting with OpenMP Download Package

AQUAgpusph, a free 3D SPH solver accelerated with OpenCL Download

Aquila 2.0: Software Architecture for Cognitive Robotics Download Package

Aquila: An Open-Source GPU-Accelerated Toolkit for Cognitive Robotics Research Download Package

Arax: a runtime framework for decoupling applications from heterogeneous accelerators Download Package

Arbitrarily large iterative tomographic reconstruction on multiple GPUs using the TIGRE toolbox Download Package

Arbitrary dimension Reed-Solomon coding and decoding for extended RAID on GPUs Download

Arbitrary-Precision Arithmetics on the GPU Download

ArborX: A Performance Portable Search Library Download Package

ARC: Adaptive Ray-tracing with CUDA, a New Ray Tracing Code for Parallel GPUs Download

ArchesWeather: An efficient AI weather forecasting model at 1.5° resolution Download

Architecting an LTE Base Station with Graphics Processing Units Download

Architecting graphics processors for non-graphics compute acceleration Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 95

Package packages: 24

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: