1173

Papers on hgpu.org (.txt-file)

Architecting graphics processors for non-graphics compute acceleration Download

Architecting SOT-RAM Based GPU Register File Download

Architectural Analysis and Performance Characterization of NVIDIA GPUs using Microbenchmarking Download

Architectural Comparisons for a Quantum Monte Carlo Application Download

Architectural Considerations for Compiler-guided Unroll-and-Jam of CUDA Kernels Download

Architectural Exploration and Scheduling Methods for Coarse Grained Reconfigurable Arrays Download

Architectural explorations for streaming accelerators with customized memory layouts Download

Architectural improvements and 28 nm FPGA implementation of the APEnet+ 3D Torus network for hybrid HPC systems Download

Architectural Principles and Experimentation of Distributed High Performance Virtual Clusters Download

Architectural Support for the Stream Execution Model on General-Purpose Processors Download

Architectural Support for Virtual Memory in GPUs Download

Architecture Comparisons between Nvidia and ATI GPUs: Computation Parallelism and Data Communications Download

Architecture of the real-time target detection processing in an airborne hyperspectral demonstrator system

Architecture-Adaptive Code Variant Tuning Download Package

Architecture-and Workload-Aware Heterogeneous Algorithms for Sparse Matrix Vector Multiplication Download

Architecture-Aware Algorithms and Software for Peta and Exascale Computing Download

Architecture-Aware Mapping and Optimization on a 1600-Core GPU Download

Architecture-Aware Mapping and Optimization on Heterogeneous Computing Systems Download

Architecture-Aware Optimization on a 1600-core Graphics Processor Download

Architecture-Aware Optimization Targeting Multithreaded Stream Computing Download

Architecture-based Performance Evaluation of Genetic Algorithms on Multi/Many-core Systems Download

Architecture, Design, and Experimental Evaluation of a Lightfield Descriptor Depth Buffer Algorithm on Reconfigurable Logic and on a GPU

Are Very Deep Neural Networks Feasible on Mobile Devices? Download

Arioc: high-throughput read alignment with GPU-accelerated exploration of the seed-and-extend search space Download Package

Aristotle: A Performance Impact Indicator for the OpenCL Kernels Using Local Memory Download Package

ARK: GPU-driven Code Execution for Distributed Deep Learning Download

ARKCoS: Artifact-Suppressed Accelerated Radial Kernel Convolution on the Sphere Download

Array Languages Make Neural Networks Fast Download Package

Array Program Transformation with Loo.py by Example: High-Order Finite Elements Download Package

Array-Oriented Languages and Polyhedral Compilation Download

ART vs. NDK vs. GPU acceleration: A study of performance of image processing algorithms on Android Download

Articulated object tracking by rendering consistent appearance parts Download

Artifact-Free Decompression and Zooming of JPEG Compressed Images with Total Generalized Variation Download

Artifact-Free JPEG Decompression with Total Generalized Variation Download

Artificial Intelligence in Electric Machine Drives: Advances and Trends Download

Artificial neural network computation on graphic process unit Download

Artificial Neural Network Simulation on CUDA Download

ARVO-CL: The OpenCL version of the ARVO package – An efficient tool for computing the accessible surface area and the excluded volume of proteins via analytical equations Download

ASAMgpu V1.0-a moist fully compressible atmospheric model using graphics processing units (GPUs) Download

Aspect-Driven Mixed-Precision Tuning Targeting GPUs Download Package

Aspects of GPU for general purpose high performance computing Download

Assembling large mosaics of electron microscope images using GPU Download

Assembly of finite element methods on graphics processors Download

Assembly-Free Large-Scale Modal Analysis on the GPU Download

Assembly-Free Structural Dynamics On CPU and GPU Download

Assessing Accelerator-Based HPC Reverse Time Migration

Assessing Application Efficiency and Performance Portability in Single-Source Programming for Heterogeneous Parallel Systems Download Package

Assessing Intel OneAPI capabilities and cloud-performance for heterogeneous computing Download

Assessing Opportunities of SYCL and Intel oneAPI for Biological Sequence Alignment Download Package

Assessing opportunities of SYCL for biological sequence alignment on GPU-based systems Download Package

Assessing the feasibility of OpenCL CPU implementations for agent-based simulations Download Package

Assessing the hardness of SVP algorithms in the presence of CPUs and GPUs Download

Assessing the Impact of Compiler Optimizations on GPUs Reliability Download

Assessing the Performance-Energy Balance of Graphics Processors for Spectral Unmixing Download

Assessment of GPU computational enhancement to a 2D flood model

Assessment of various GPU acceleration strategies in text categorization processing flow Download

Astronomical Photometric Data Reduction Using GPGPU Download

Astrophysical data mining with GPU. A case study: genetic classification of globular clusters Download

Astrophysical Particle Simulations on Heterogeneous CPU-GPU Systems Download

Astrophysical Particle Simulations with Custom GPU Clusters Download

Astrophysical particle simulations with large custom GPU clusters on three continents Download

Astrophysical particle simulations with large custom GPU clusters on three continents Download

Astrophysical Supercomputing with GPUs: Critical Decisions for Early Adopters Download

Astrophysical-oriented Computational multi-Architectural Framework Download

ASW: Accelerating Smith-Waterman Algorithm on Coupled CPU-GPU Architecture Download

AsymML: An Asymmetric Decomposition Framework for Privacy-Preserving DNN Training and Inference Download

Asymptotic Peak Utilisation in Heterogeneous Parallel CPU/GPU Pipelines: A Decentralised Queue Monitoring Strategy Download

Asynchronous Communication for Finite-Difference Simulations on GPU Clusters using CUDA and MPI Download

Asynchronous Communication Schemes for Finite Difference Methods on Multiple GPUs Download

Asynchronous Methods for Deep Reinforcement Learning Download

Asynchronous OpenCL/MPI numerical simulations of conservation laws Download

Asynchronous Parallel Computing Algorithm implemented in 1D Heat Equation with CUDA Download

Asynchronous Parallel Computing Model of Global Motion Estimation with CUDA Download

Asynchronous Task-Based Polar Decomposition on Single Node Manycore Architectures Download

ATI Stream Profiler: a tool to optimize an OpenCL kernel on ATI Radeon GPUs Download

Atmospheric Chemistry Download

Atmospheric turbulence removal using convolutional neural network Download

Atomic-free Irregular Computations on GPUs Download Package

Atos: A Task-Parallel GPU Dynamic Scheduling Framework for Dynamic Irregular Computations Download

Attack Signature Matching using Graphics Processors in High-Performance Intrusion Detection Systems Download

Attaining system performance points: revisiting the end-to-end argument in system design for heterogeneous many-core systems

Attention-based NMT Models as Feature Functions in Phrase-based SMT Download

ATTILA: a cycle-level execution-driven simulator for modern GPU architectures Download

Audiovisual Voice Activity Detection and Localization of Simultaneous Speech Sources Download

Augmented reality live-action compositing Download

Augmented reality usage for prototyping speed up Download

Augmenting Operating Systems With the GPU Download Package

Augur: a Modeling Language for Data-Parallel Probabilistic Inference Download

Aurally and visually enhanced audio search with soundtorch

AUTO-GC: Automatic translation of data mining applications to GPU clusters

Auto-Generation and Auto-Tuning of 3D Stencil Codes on GPU Clusters Download

Auto-Generation and Auto-Tuning of 3D Stencil Codes on Homogeneous and Heterogeneous GPU Clusters Download

Auto-Generation of Parallel Finite-Differencing Code for MPI, TBB and CUDA Download

Auto-optimization of a Feature Selection Algorithm Download

Auto-SpMV: Automated Optimizing SpMV Kernels on GPU Download

Auto-tunable GPU BLAS Download

Auto-tunable GPU BLAS (thesis) Download Package

Auto-tuned OpenCL kernel co-execution in OmpSs for heterogeneous systems Download

Auto-tuning 3-D FFT library for CUDA GPUs

Auto-tuning a High-Level Language Targeted to GPU Codes Download

 

Brief statistics for this page

Titles: 100

Doubles=1

Download open PDFs: 92

Package packages: 13

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: