1173

Papers on hgpu.org (.txt-file)

Auto-Tuning CUDA Parameters for Sparse Matrix-Vector Multiplication on GPUs Download

Auto-Tuning Dedispersion for Many-Core Accelerators Download

Auto-tuning Dense Matrix Multiplication for GPGPU with Cache

Auto-tuning Dense Vector and Matrix-Vector Operations for Fermi GPUs Download

Auto-tuning Hybrid CPU-GPU Execution of Algorithmic Skeletons in SkePU Download

Auto-tuning interactive ray tracing using an analytical GPU architecture model Download

Auto-tuning of fast fourier transform on graphics processors Download

Auto-Tuning of Level 1 and Level 2 BLAS for GPUs Download

Auto-tuning on the macro scale: high level algorithmic auto-tuning for scientific applications Download

Auto-tuning Shallow water simulations on GPUs Download

Auto-tuning SkePU: a multi-backend skeleton programming framework for multi-GPU systems Download Package

Auto-tuning Streamed Applications on Intel Xeon Phi Download Package

Auto-Tunning of Data Communication on Heterogeneous Systems Download

Auto-Vectorizing a Large-scale Production Unstructured-mesh CFD Application Download

AutoDDL: Automatic Distributed Deep Learning with Asymptotically Optimal Communication Download Package

AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning Download Package

AutoMat – Automatic Differentiation for Generalized Standard Materials on GPUs Download

Automated and interactive approaches for optimal surface finding based segmentation of medical image data Download

Automated and parallel code generation for finite-differencing stencils with arbitrary data types Download

Automated Architecture Design for Deep Neural Networks Download

Automated architecture-aware mapping of streaming applications onto GPUs Download

Automated Buffer Sizing of Dataflow Applications in a High-Level Synthesis Workflow Download Package

Automated C/C++ Program Repair for High-Level Synthesis via Large Language Models Download

Automated Deep Learning Optimization via DSL-Based Source Code Transformation Download Package

Automated development of applications for graphical processing units using rewriting rules

Automated Dynamic Analysis of CUDA Programs Download

Automated Enhanced Parallelization of Sequential C to Parallel OpenMP Download

Automated Generation of OpenCL Programs Based on Algebra-Algorithmic Approach Download

Automated GPU Kernel Transformations in Large-Scale Production Stencil Applications Download

Automated image alignment for 2D gel electrophoresis in a high-throughput proteomics pipeline Download Package

Automated Long-Term Monitoring of Parallel Microfluidic Operations Applying a Machine Vision-Assisted Positioning Method Download

Automated Partitioning of Data-Parallel Kernels using Polyhedral Compilation Download Package

Automated pose estimation in 3D point clouds applying annealing particle filters and inverse kinematics on a GPU Download

Automated Runtime Analysis and Adaptation for Scalable Heterogeneous Computing Download Package

Automated Software Testing of Memory Performance in Embedded GPUs Download

Automated Techniques for Enabling Efficient MPI Application Migration Download

Automated test generation for OpenCL kernels using fuzzing and constraint solving Download Package

Automated Testing of Graphics Shader Compilers Download

Automated Tool to Generate Parallel CUDA code from a Serial C Code Download

Automatic abstraction and fault tolerance in cortical microachitectures Download

Automatic acceleration of Numpy applications on GPUs and multicore CPUs Download

Automatic and Explicit Parallelization Approaches for Mathematical Simulation Models Download

Automatic and portable mapping of data parallel programs to OpenCL for GPU-based heterogeneous systems Download

Automatic bi-layer video segmentation based on sensor fusion Download

Automatic BLAS Offloading on Unified Memory Architecture: A Study on NVIDIA Grace-Hopper Download

Automatic C-to-CUDA Code Generation for Affine Programs Download

Automatic classification of object code using machine learning Download

Automatic Code Generation and Adaptive Grid Scheduling for GPU Cluster Computing Download

Automatic code generation and tuning for stencil kernels on modern shared memory architectures Download Package

Automatic code generation for solvers of cardiac cellular membrane dynamics in GPUs

Automatic Code Generation for Stencil Computations on GPU Architectures Download

Automatic code generation methods applied to numerical linear algebra in high performance computing Download

Automatic Code Rewriting for Performance Portability Download

Automatic Command Queue Scheduling for Task-Parallel Workloads in OpenCL Download

Automatic Compilation for Heterogeneous Architectures with Single Assignment C Download

Automatic compilation of MATLAB programs for synergistic execution on heterogeneous processors Download

Automatic Compiler Based FPGA Accelerator for CNN Training Download

Automatic contention detection and amelioration for data-intensive operations Download

Automatic CPU-GPU communication management and optimization Download

Automatic CUDA Code Synthesis Framework for Multicore CPU and GPU architectures Download

Automatic Data Layout Generation and Kernel Mapping for CPU+GPU Architectures Download

Automatic Data Layout Optimizations for GPUs Download

Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories Download

Automatic Detection and Denoising of Signals in Large Geophysical Datasets Download

Automatic Discovery of Algorithms for Multi-Agent Systems Download

Automatic Dynamic Task Distribution between CPU and GPU for Real-Time Systems

Automatic efficient data layout for multithreaded stencil codes on CPUs and GPUs Download

Automatic fitting of spiking neuron models to electrophysiological recordings Download Package

Automatic Fusions of CUDA-GPU Kernels for Parallel Map Download

Automatic Generation Of Application-Specific Accelerators for FPGAs from Python Loop Nests Download Package

Automatic generation of CUDA code performing tensor manipulations using C++ expression templates Download

Automatic Generation of FFT Libraries for GPU Platforms Download

Automatic generation of heterogeneous spectrometers for radio astronomy Download

Automatic Generation of Multicore Chemical Kernels

Automatic Generation of OpenCL Code for ARM Architectures Download

Automatic Generation of OpenCL Code through Polyhedral Compilation with LLM Download

Automatic generation of software pipelines for heterogeneous parallel systems Download

Automatic generation of warp-level primitives and atomic instructions for fast and portable parallel reduction on GPUs Download

Automatic GPU optimization through higher-order functions in functional languages Download

Automatic Hepatic Vessel Segmentation Using Graphics Hardware Download

Automatic Implementation of Evolutionary Algorithms on GPUs using ESDL Download Package

Automatic Kernel Generation for Volta Tensor Cores Download

Automatic library generation for BLAS3 on GPUs Download

Automatic Loop Partitioning for Heterogeneous Systems Download

Automatic Mapping for OpenCL-Programs on CPU/GPU Heterogeneous Platforms Download

Automatic Mapping of Stream Programs on Multicore Architectures Download

Automatic Multi-Camera Setup Optimization for Optical Tracking Download

Automatic Multi-GPU Code Generation applied to Simulation of Electrical Machines Download

Automatic NUMA Characterization using Cbench Download Package

Automatic Online Tuning (AutoTune): Fully Extended Analysis Download

Automatic OpenCL code generation for multi-device heterogeneous architectures Download

Automatic OpenCL Device Characterization: Guiding Optimized Kernel Design Download

Automatic OpenCL Task Adaptation for Heterogeneous Architectures Download

Automatic Optimization of In-Flight Memory Transactions for GPU Accelerators based on a Domain-Specific Language for Medical Imaging Download Package

Automatic Optimization of OpenCL-Based Stencil Codes for FPGAs and Its Evaluation Download

Automatic Optimization of Thread Mapping for a GPGPU Programming Framework Download

Automatic Parallelization for GPUs Download

Automatic parallelization for graphics processing units Download

Automatic Parallelization for Heterogeneous Embedded Systems Download

Automatic Parallelization of a Gap Model using Java and OpenCL Download

 

Brief statistics for this page

Titles: 100

Download open PDFs: 95

Package packages: 16

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: