1173

Papers on hgpu.org (.txt-file)

Automatic Parallelization of Tiled Loop Nests with Enhanced Fine-Grained Parallelism on GPUs Download

Automatic Parallelization of Tiled Stencil Loop Nests on GPUs Download

Automatic Parallelization: Executing Sequential Programs on a Task-Based Parallel Runtime Download Package

Automatic Performance Optimisation of Parallel Programs for GPUs via Rewrite Rules Download

Automatic Performance Optimization in ViennaCL for GPUs Download Package

Automatic Performance Optimization on Heterogeneous Computer Systems using Manycore Coprocessors Download

Automatic Performance Tuning of Pipeline Patterns for Heterogeneous Parallel Architectures Download

Automatic Performance Tuning of Stencil Computations on Graphics Processing Units Download

Automatic Point Target Detection for Interactive Visual Analysis of SAR Images Download

Automatic Pose Estimation for Range Images on the GPU Download

Automatic program analysis for data parallel kernels Download

Automatic program parallelization for multicore processors Download

Automatic Resource-Constrained Static Task Parallelization Download

Automatic run-time mapping of polyhedral computations to heterogeneous devices with memory-size restrictions Download

Automatic safety proofs for asynchronous memory operations Download

Automatic Scan Parallelization in OpenMP Download

Automatic scanning of nuclear emulsions with wide-angle acceptance for nuclear fragment detection Download

Automatic Scheduling of Compute Kernels Across Heterogeneous Architectures Download

Automatic Selection of Sparse Matrix Representation on GPUs Download

Automatic shader level of detail

Automatic SIMD Code Generation Download

Automatic Skeleton-Based Compilation through Integration with an Algorithm Classification Download

Automatic Software Synthesis from High-Level ForSyDe Models Targeting Massively Parallel Processors Download

Automatic source code adaptation for heterogeneous platforms Download

Automatic Synthesis of Heterogeneous CPU-GPU Embedded Applications from a UML Profile Download

Automatic Termination Analysis for GPU Kernels Download

Automatic Test Case Reduction for OpenCL Download

Automatic test case reduction of randomly generated OpenCL kernels Download

Automatic transformation and optimization of applications on GPUs and GPU clusters Download

Automatic Translation of CUDA to OpenCL and Comparison of Performance Optimizations on GPUs Download

Automatic tuning matrix multiplication performance on graphics hardware Download

Automatic Tuning of Local Memory Use on GPGPUs Download

Automatic Virtualization of Accelerators Download

Automatically Exploiting the Memory Hierarchy of GPUs through Just-in-Time Compilation Download Package

Automatically generating and tuning GPU code for sparse matrix-vector multiplication from a high-level representation Download

Automatically Generating Efficient Simulation Codes on GPUs from Partial Differential Equations Download

Automatically Harnessing Sparse Acceleration Download

Automatically Selecting Profitable Thread Block Sizes Using Machine Learning Download

Automatically translating a general purpose C++ image processing library for GPUs Download

Automatically Tuned Dense Linear Algebra for Multicore+GPU Download Package

Automatically Tuning Sparse Matrix-Vector Multiplication for GPU Architectures

Automating a Labour Performance Measurement and Risk Assessment: An Evaluation of Methods for a Computer Vision based System Download

Automating elimination of idle functions by run-time reconfiguration Download

Automating GPU computing in MATLAB

Automating Heterogeneous Parallelism in Numerical Differential Equations Download Package

Automating the Last-Mile for High Performance Dense Linear Algebra Download

AutOMP: An Automatic OpenMP Parallelization Generator for Variable-Oriented High-Performance Scientific Codes Download

AutoParBench: A Unified Test Framework for OpenMP-based Parallelizers Download Package

AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning Download

Autotuning CUDA Compiler Parameters for Heterogeneous Applications using the OpenTuner Framework Download Package

Autotuning CUDA: Applying NLP Techniques to LS-CAT Download

Autotuning for Automatic Parallelization on Heterogeneous Systems Download

Autotuning GEMMs for Fermi Download Package

Autotuning GPU Kernels via Static and Predictive Analysis Download

Autotuning of Pattern Runtimes for Accelerated Parallel Systems Download

Autotuning OpenACC Work Distribution via Direct Search Download Package

Autotuning OpenCL Workgroup Size for Stencil Patterns Download

Autotuning Programs with Algorithmic Choice Download Package

Autotuning Stencil-Based Computations on GPUs Download Package

Autotuning Stencils Codes with Algorithmic Skeletons Download Package

Autotuning Tensor Contraction Computations on GPUs Download

Autotuning Wavefront Abstractions for Heterogeneous Architectures Download

Autotuning Wavefront Patterns for Heterogeneous Architectures Download

Autotuning, Code Generation and Optimizing Compiler Technology for GPUs Download

Auxiliary Image Regularization for Deep CNNs with Noisy Labels Download

AvA: Accelerated Virtualization of Accelerators Download Package

AVEC: Accelerator Virtualization in Cloud-Edge Computing for Deep Learning Libraries Download

AVSS2011 demo session: GPU enabled Smart Video Node Download

AVX-512 extension to OpenQCD 1.6 Download Package

AXC: A new format to perform the SpMV oriented to Intel Xeon Phi architecture in OpenCL Download

Axel: a heterogeneous cluster with FPGAs and GPUs Download

AZP: Automatic Specialization for Zero Values in Gaming Applications Download

b-Bit Minwise Hashing in Practice: Large-Scale Batch and Online Learning and Using GPUs for Fast Preprocessing with Simple Hash Functions Download

B-CALM: An open-source GPU-based 3D-FDTD with multi-pole dispersion for plasmonics Download Package

B-Calm: an Open-Source Multi-Gpu-Based 3D-FDTD with Multi-Pole Dispersion for Plasmonics Download Package

Back Ground Subtraction Algorithm For Moving Object Detection In FPGA Download

Backpropagation Training for Fisher Vectors within Neural Networks Download

BaCO: A Fast and Portable Bayesian Compiler Optimization Framework Download Package

Bacon: A GPU Programming System With Just in Time Specialization Download Package

Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs Download Package

Balancing locality and concurrency: solving sparse triangular systems on GPUs Download

Balancing Tracking Granularity and Parallelism in Many-Task Systems: The Horizons Approach Download Package

Bamboo: Automatic Translation of MPI Source into a Latency-Tolerant Form Download Package

Bandicoot: C++ Library for GPU Linear Algebra and Scientific Computing Download Package

Bandwidth intensive 3-D FFT kernel for GPUs using CUDA

Bandwidth Reduction Through Multithreaded Compression of Seismic Images Download

Bandwidth Requirements of GPU Architectures Download

BANG: Billion-Scale Approximate Nearest Neighbor Search using a Single GPU Download

Barnes-hut treecode on GPU

Barra, a Modular Functional GPU Simulator for GPGPU Download Package

Barra: A Parallel Functional Simulator for GPGPU Download Package

BarraCUDA – a fast short read sequence aligner using graphics processing units Download Package

Barrier Invariants: A Shared State Abstraction for the Analysis of Data-Dependent GPU Kernels Download Package

Barycentric coordinates computation in homogeneous coordinates Download

BASEMENT v3: a modular freeware for river process modelling over multiple computational backends Download Package

Basker: A Threaded Sparse LU Factorization Utilizing Hierarchical Parallelism and Data Layouts Download

BAT: A Benchmark suite for AutoTuners Download Package

Batch Method for Efficient Resource Sharing in Real-time Multi-GPU Systems Download

Batch Records Insertion into Multidimensional Linear Dynamic Hashing Table on GPU Download

Batched Kronecker product for 2-D matrices and 3-D arrays on NVIDIA GPUs Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 95

Package packages: 28

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: