1173

Papers on hgpu.org (.txt-file)

Automatic NUMA Characterization using Cbench Download Package

Automatic Online Tuning (AutoTune): Fully Extended Analysis Download

Automatic OpenCL code generation for multi-device heterogeneous architectures Download

Automatic OpenCL Device Characterization: Guiding Optimized Kernel Design Download

Automatic OpenCL Task Adaptation for Heterogeneous Architectures Download

Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators based on a Domain-Specific Language for Medical Imaging Download Package

Automatic Optimization of OpenCL-Based Stencil Codes for FPGAs and Its Evaluation Download

Automatic Optimization of Thread Mapping for a GPGPU Programming Framework Download

Automatic Parallelization for GPUs Download

Automatic parallelization for graphics processing units Download

Automatic Parallelization for Heterogeneous Embedded Systems Download

Automatic Parallelization of a Gap Model using Java and OpenCL Download

Automatic Parallelization of Tiled Loop Nests with Enhanced Fine-Grained Parallelism on GPUs Download

Automatic Parallelization of Tiled Stencil Loop Nests on GPUs Download

Automatic Parallelization: Executing Sequential Programs on a Task-Based Parallel Runtime Download Package

Automatic Performance Optimisation of Parallel Programs for GPUs via Rewrite Rules Download

Automatic Performance Optimization in ViennaCL for GPUs Download Package

Automatic Performance Optimization on Heterogeneous Computer Systems using Manycore Coprocessors Download

Automatic Performance Tuning of Pipeline Patterns for Heterogeneous Parallel Architectures Download

Automatic Performance Tuning of Stencil Computations on Graphics Processing Units Download

Automatic Point Target Detection for Interactive Visual Analysis of SAR Images Download

Automatic Pose Estimation for Range Images on the GPU Download

Automatic program analysis for data parallel kernels Download

Automatic program parallelization for multicore processors Download

Automatic Resource-Constrained Static Task Parallelization Download

Automatic run-time mapping of polyhedral computations to heterogeneous devices with memory-size restrictions Download

Automatic safety proofs for asynchronous memory operations Download

Automatic Scan Parallelization in OpenMP Download

Automatic scanning of nuclear emulsions with wide-angle acceptance for nuclear fragment detection Download

Automatic Scheduling of Compute Kernels Across Heterogeneous Architectures Download

Automatic Selection of Sparse Matrix Representation on GPUs Download

Automatic shader level of detail

Automatic SIMD Code Generation Download

Automatic Skeleton-Based Compilation through Integration with an Algorithm Classification Download

Automatic Software Synthesis from High-Level ForSyDe Models Targeting Massively Parallel Processors Download

Automatic source code adaptation for heterogeneous platforms Download

Automatic Synthesis of Heterogeneous CPU-GPU Embedded Applications from a UML Profile Download

Automatic Termination Analysis for GPU Kernels Download

Automatic Test Case Reduction for OpenCL Download

Automatic test case reduction of randomly generated OpenCL kernels Download

Automatic transformation and optimization of applications on GPUs and GPU clusters Download

Automatic Translation of CUDA to OpenCL and Comparison of Performance Optimizations on GPUs Download

Automatic tuning matrix multiplication performance on graphics hardware Download

Automatic Tuning of Local Memory Use on GPGPUs Download

Automatic Virtualization of Accelerators Download

Automatically Exploiting the Memory Hierarchy of GPUs through Just-in-Time Compilation Download Package

Automatically generating and tuning GPU code for sparse matrix-vector multiplication from a high-level representation Download

Automatically Generating Efficient Simulation Codes on GPUs from Partial Differential Equations Download

Automatically Harnessing Sparse Acceleration Download

Automatically Selecting Profitable Thread Block Sizes Using Machine Learning Download

Automatically translating a general purpose C++ image processing library for GPUs Download

Automatically Tuned Dense Linear Algebra for Multicore+GPU Download Package

Automatically Tuning Sparse Matrix-Vector Multiplication for GPU Architectures

Automating a Labour Performance Measurement and Risk Assessment: An Evaluation of Methods for a Computer Vision based System Download

Automating elimination of idle functions by run-time reconfiguration Download

Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach Download

Automating GPU computing in MATLAB

Automating Heterogeneous Parallelism in Numerical Differential Equations Download Package

Automating the Last-Mile for High Performance Dense Linear Algebra Download

AutOMP: An Automatic OpenMP Parallelization Generator for Variable-Oriented High-Performance Scientific Codes Download

AutoParBench: A Unified Test Framework for OpenMP-based Parallelizers Download Package

AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning Download

Autotuning CUDA Compiler Parameters for Heterogeneous Applications using the OpenTuner Framework Download Package

Autotuning CUDA: Applying NLP Techniques to LS-CAT Download

Autotuning for Automatic Parallelization on Heterogeneous Systems Download

Autotuning GEMMs for Fermi Download Package

Autotuning GPU Kernels via Static and Predictive Analysis Download

Autotuning of Pattern Runtimes for Accelerated Parallel Systems Download

Autotuning OpenACC Work Distribution via Direct Search Download Package

Autotuning OpenCL Workgroup Size for Stencil Patterns Download

Autotuning Programs with Algorithmic Choice Download Package

Autotuning Stencil-Based Computations on GPUs Download Package

Autotuning Stencils Codes with Algorithmic Skeletons Download Package

Autotuning Tensor Contraction Computations on GPUs Download

Autotuning Wavefront Abstractions for Heterogeneous Architectures Download

Autotuning Wavefront Patterns for Heterogeneous Architectures Download

Autotuning, Code Generation and Optimizing Compiler Technology for GPUs Download

Auxiliary Image Regularization for Deep CNNs with Noisy Labels Download

AvA: Accelerated Virtualization of Accelerators Download Package

AVEC: Accelerator Virtualization in Cloud-Edge Computing for Deep Learning Libraries Download

AVSS2011 demo session: GPU enabled Smart Video Node Download

AVX-512 extension to OpenQCD 1.6 Download Package

AXC: A new format to perform the SpMV oriented to Intel Xeon Phi architecture in OpenCL Download

Axel: a heterogeneous cluster with FPGAs and GPUs Download

AZP: Automatic Specialization for Zero Values in Gaming Applications Download

b-Bit Minwise Hashing in Practice: Large-Scale Batch and Online Learning and Using GPUs for Fast Preprocessing with Simple Hash Functions Download

B-CALM: An open-source GPU-based 3D-FDTD with multi-pole dispersion for plasmonics Download Package

B-Calm: an Open-Source Multi-Gpu-Based 3D-FDTD with Multi-Pole Dispersion for Plasmonics Download Package

Back Ground Subtraction Algorithm For Moving Object Detection In FPGA Download

Backpropagation Training for Fisher Vectors within Neural Networks Download

BaCO: A Fast and Portable Bayesian Compiler Optimization Framework Download Package

Bacon: A GPU Programming System With Just in Time Specialization Download Package

Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs Download Package

Balancing locality and concurrency: solving sparse triangular systems on GPUs Download

Balancing Tracking Granularity and Parallelism in Many-Task Systems: The Horizons Approach Download Package

Bamboo: Automatic Translation of MPI Source into a Latency-Tolerant Form Download Package

Bandicoot: C++ Library for GPU Linear Algebra and Scientific Computing Download Package

Bandwidth intensive 3-D FFT kernel for GPUs using CUDA

Bandwidth Reduction Through Multithreaded Compression of Seismic Images Download

Bandwidth Requirements of GPU Architectures Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 96

Package packages: 24

Recent source codes

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org