1173

Papers on hgpu.org (.txt-file)

Architecture-Aware Optimization Targeting Multithreaded Stream Computing Download

Architecture-based Performance Evaluation of Genetic Algorithms on Multi/Many-core Systems Download

Architecture, Design, and Experimental Evaluation of a Lightfield Descriptor Depth Buffer Algorithm on Reconfigurable Logic and on a GPU

Are Very Deep Neural Networks Feasible on Mobile Devices? Download

Arioc: high-throughput read alignment with GPU-accelerated exploration of the seed-and-extend search space Download Package

Aristotle: A Performance Impact Indicator for the OpenCL Kernels Using Local Memory Download Package

ARK: GPU-driven Code Execution for Distributed Deep Learning Download

ARKCoS: Artifact-Suppressed Accelerated Radial Kernel Convolution on the Sphere Download

Array Languages Make Neural Networks Fast Download Package

Array Program Transformation with Loo.py by Example: High-Order Finite Elements Download Package

Array-Oriented Languages and Polyhedral Compilation Download

ART vs. NDK vs. GPU acceleration: A study of performance of image processing algorithms on Android Download

Articulated object tracking by rendering consistent appearance parts Download

Artifact-Free Decompression and Zooming of JPEG Compressed Images with Total Generalized Variation Download

Artifact-Free JPEG Decompression with Total Generalized Variation Download

Artificial Intelligence in Electric Machine Drives: Advances and Trends Download

Artificial neural network computation on graphic process unit Download

Artificial Neural Network Simulation on CUDA Download

ARVO-CL: The OpenCL version of the ARVO package – An efficient tool for computing the accessible surface area and the excluded volume of proteins via analytical equations Download

ASAMgpu V1.0-a moist fully compressible atmospheric model using graphics processing units (GPUs) Download

Aspect-Driven Mixed-Precision Tuning Targeting GPUs Download Package

Aspects of GPU for general purpose high performance computing Download

Assembling large mosaics of electron microscope images using GPU Download

Assembly of finite element methods on graphics processors Download

Assembly-Free Large-Scale Modal Analysis on the GPU Download

Assembly-Free Structural Dynamics On CPU and GPU Download

Assessing Accelerator-Based HPC Reverse Time Migration

Assessing Application Efficiency and Performance Portability in Single-Source Programming for Heterogeneous Parallel Systems Download Package

Assessing Opportunities of SYCL and Intel oneAPI for Biological Sequence Alignment Download Package

Assessing opportunities of SYCL for biological sequence alignment on GPU-based systems Download Package

Assessing the feasibility of OpenCL CPU implementations for agent-based simulations Download Package

Assessing the hardness of SVP algorithms in the presence of CPUs and GPUs Download

Assessing the Impact of Compiler Optimizations on GPUs Reliability Download

Assessing the Performance-Energy Balance of Graphics Processors for Spectral Unmixing Download

Assessment of GPU computational enhancement to a 2D flood model

Assessment of various GPU acceleration strategies in text categorization processing flow Download

Astronomical Photometric Data Reduction Using GPGPU Download

Astrophysical data mining with GPU. A case study: genetic classification of globular clusters Download

Astrophysical Particle Simulations on Heterogeneous CPU-GPU Systems Download

Astrophysical Particle Simulations with Custom GPU Clusters Download

Astrophysical particle simulations with large custom GPU clusters on three continents Download

Astrophysical particle simulations with large custom GPU clusters on three continents Download

Astrophysical Supercomputing with GPUs: Critical Decisions for Early Adopters Download

Astrophysical-oriented Computational multi-Architectural Framework Download

ASW: Accelerating Smith-Waterman Algorithm on Coupled CPU-GPU Architecture Download

AsymML: An Asymmetric Decomposition Framework for Privacy-Preserving DNN Training and Inference Download

Asymptotic Peak Utilisation in Heterogeneous Parallel CPU/GPU Pipelines: A Decentralised Queue Monitoring Strategy Download

Asynchronous Communication for Finite-Difference Simulations on GPU Clusters using CUDA and MPI Download

Asynchronous Communication Schemes for Finite Difference Methods on Multiple GPUs Download

Asynchronous Methods for Deep Reinforcement Learning Download

Asynchronous OpenCL/MPI numerical simulations of conservation laws Download

Asynchronous Parallel Computing Algorithm implemented in 1D Heat Equation with CUDA Download

Asynchronous Parallel Computing Model of Global Motion Estimation with CUDA Download

Asynchronous Task-Based Polar Decomposition on Single Node Manycore Architectures Download

ATI Stream Profiler: a tool to optimize an OpenCL kernel on ATI Radeon GPUs Download

Atmospheric Chemistry Download

Atmospheric turbulence removal using convolutional neural network Download

Atomic-free Irregular Computations on GPUs Download Package

Atos: A Task-Parallel GPU Dynamic Scheduling Framework for Dynamic Irregular Computations Download

Attack Signature Matching using Graphics Processors in High-Performance Intrusion Detection Systems Download

Attaining system performance points: revisiting the end-to-end argument in system design for heterogeneous many-core systems

Attention-based NMT Models as Feature Functions in Phrase-based SMT Download

ATTILA: a cycle-level execution-driven simulator for modern GPU architectures Download

Audiovisual Voice Activity Detection and Localization of Simultaneous Speech Sources Download

Augmented reality live-action compositing Download

Augmented reality usage for prototyping speed up Download

Augmenting Operating Systems With the GPU Download Package

Augur: a Modeling Language for Data-Parallel Probabilistic Inference Download

Aurally and visually enhanced audio search with soundtorch

AUTO-GC: Automatic translation of data mining applications to GPU clusters

Auto-Generation and Auto-Tuning of 3D Stencil Codes on GPU Clusters Download

Auto-Generation and Auto-Tuning of 3D Stencil Codes on Homogeneous and Heterogeneous GPU Clusters Download

Auto-Generation of Parallel Finite-Differencing Code for MPI, TBB and CUDA Download

Auto-optimization of a Feature Selection Algorithm Download

Auto-SpMV: Automated Optimizing SpMV Kernels on GPU Download

Auto-tunable GPU BLAS Download

Auto-tunable GPU BLAS (thesis) Download Package

Auto-tuned OpenCL kernel co-execution in OmpSs for heterogeneous systems Download

Auto-tuning 3-D FFT library for CUDA GPUs

Auto-tuning a High-Level Language Targeted to GPU Codes Download

Auto-tuning a LOFAR radio astronomy pipeline in JavaCL Download

Auto-Tuning CUDA Parameters for Sparse Matrix-Vector Multiplication on GPUs Download

Auto-Tuning Dedispersion for Many-Core Accelerators Download

Auto-tuning Dense Matrix Multiplication for GPGPU with Cache

Auto-tuning Dense Vector and Matrix-Vector Operations for Fermi GPUs Download

Auto-tuning Hybrid CPU-GPU Execution of Algorithmic Skeletons in SkePU Download

Auto-tuning interactive ray tracing using an analytical GPU architecture model Download

Auto-tuning of fast fourier transform on graphics processors Download

Auto-Tuning of Level 1 and Level 2 BLAS for GPUs Download

Auto-tuning on the macro scale: high level algorithmic auto-tuning for scientific applications Download

Auto-tuning Shallow water simulations on GPUs Download

Auto-tuning SkePU: a multi-backend skeleton programming framework for multi-GPU systems Download Package

Auto-tuning Streamed Applications on Intel Xeon Phi Download Package

Auto-Tunning of Data Communication on Heterogeneous Systems Download

Auto-Vectorizing a Large-scale Production Unstructured-mesh CFD Application Download

AutoDDL: Automatic Distributed Deep Learning with Asymptotically Optimal Communication Download Package

AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning Download Package

AutoMat – Automatic Differentiation for Generalized Standard Materials on GPUs Download

Automated and interactive approaches for optimal surface finding based segmentation of medical image data Download

Automated and parallel code generation for finite-differencing stencils with arbitrary data types Download

 

Brief statistics for this page

Titles: 100

Doubles=1

Download open PDFs: 92

Package packages: 16

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: