1173

Papers on hgpu.org (.txt-file)

Character-level Transformer-based Neural Machine Translation Download

Charactering and Detecting CUDA Program Bugs Download Package

Characterising Across-Stack Optimisations for Deep Convolutional Neural Networks Download Package

Characterising Bipartite Graph Matching Algorithms on GPUs Download

Characterization and Analysis of Dynamic Parallelism in Unstructured GPU Applications Download

Characterization and Exploitation of GPU Memory Systems Download

Characterization and Performance Analysis for 3D Benchmarks Download

Characterization and Transformation of Unstructured Control Flow in Bulk Synchronous GPU Applications Download

Characterization and Transformation of Unstructured Control Flow in GPU Applications Download

Characterization of FPGA-based High Performance Computers Download

Characterization of Lossy SIW Resonators Based on Multilayer Perceptron Neural Networks on Graphics Processing Unit Download

Characterization of OpenCL on a Scalable FPGA Architecture Download

Characterization of Speech Recognition Systems on GPU Architectures Download

Characterizing and Enhancing Global Memory Data Coalescing on GPUs Download

Characterizing and Evaluating a Key-value Store Application on Heterogeneous CPU-GPU Systems Download

Characterizing and Improving the Use of Demand-Fetched Caches in GPUs Download

Characterizing and Optimizing Irregular Applications on Graphics Processing Units Download

Characterizing and Predicting Scientific Workloads for Heterogeneous Computing Systems Download Package

Characterizing CUDA and OpenMP Synchronization Primitives Download Package

Characterizing Dataset Dependence for Sparse Matrix-Vector Multiplication on GPUs Download

Characterizing Deep Learning Training Workloads on Alibaba-PAI Download

Characterizing Optimizations to Memory Access Patterns using Architecture-Independent Program Features Download

Characterizing the Challenges and Evaluating the Efficacy of a CUDA-to-OpenCL Translator Download

Charged particles constrained to a curved surface Download Package

CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types Download Package

Chat AI: A Seamless Slurm-Native Solution for HPC-Based Services Download Package

Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs Download Package

CheCL: Transparent Checkpointing and Process Migration of OpenCL Applications Download

CheCUDA: A Checkpoint/Restart Tool for CUDA Applications Download

Chest CT automatic analysis for lung nodules detection implemented on a GPU computing system Download

Chestnut: A GPU Programming Language for Non-Experts Download

CHO: A Benchmark Suite for OpenCL-based FPGA Accelerators Download Package

CHO: Towards a Benchmark Suite for OpenCL FPGA Accelerators Download Package

Cholla : A New Massively-Parallel Hydrodynamics Code For Astrophysical Simulation Download

CHPS: An Environment for Collaborative Execution on Heterogeneous Desktop Systems Download

Chrono: a parallel multi-physics library for rigid-body, flexible-body, and fluid dynamics Download Package

Chunkflow: Distributed Hybrid Cloud Processing of Large 3D Images by Convolutional Nets Download Package

CI/CD Efforts for Validation, Verification and Benchmarking OpenMP Implementations Download

Cinematic Particle Systems with OpenCL Download Package

Circular Hough Transform in OpenCL Download Package

CitiusSynapse: A Deep Learning Framework for Embedded Systems Download

CL-VIS: Visualization Platform for Understanding and Checking the OpenCL Programs Download

CL2QCD – Lattice QCD based on OpenCL Download Package

Clacc: Translating OpenACC to OpenMP in Clang Download

Classical Mechanical Hard-Core Particles Simulated in a Rigid Enclosure using Multi-GPU Systems Download

Classical Simulation of Quantum Adiabatic Algorithms using Mathematica on GPUs Download Package

Classiffication-based Financial Markets Prediction using Deep Neural Networks Download

Classification of Higgs Boson Tau-Tau decays using GPU accelerated Neural Networks Download

Classification Performance of Convolutional Neural Networks Download

Classify QCD phase transition with deep learning Download Package

ClawHMMER: A Streaming HMMer-Search Implementation Download

CLBlast: A Tuned OpenCL BLAS Library Download Package

ClearPath: highly parallel collision avoidance for multi-agent simulation Download

ClearView: An Interactive Context Preserving Hotspot Visualization Technique Download Package

CLgrep: A Parallel String Matching Tool Download Package

Climbing Mont Blanc – A Training Site for Energy Efficient Programming on Heterogeneous Multicore Processors Download

Clinically applicable Monte Carlo-based biological dose optimization for the treatment of head and neck cancers with spot-scanning proton therapy Download

Clipmapping on the GPU Download

clMAGMA: High Performance Dense Linear Algebra with OpenCL Download Package

clMF: A fine-grained and portable alternating least squares algorithm for parallel matrix factorization Download Package

Clock Math – A System for Solving SLEs Exactly Download

CLOP: A Multi-stage Compiler to Seamlessly Embed Heterogeneous Code Download Package

clOpenCL – Supporting Distributed Heterogeneous Computing in HPC Clusters Download

CLort: High Throughput and Low Energy Network Intrusion Detection on IoT Devices with Embedded GPUs Download Package

Closing the Ninja Performance Gap through Traditional Programming and Compiler Technology Download

Cloth Simulation on the GPU Download

Cloth Simulation Using AABB Hierarchies and GPU Parallelism

CloudCL: Single-Paradigm Distributed Heterogeneous Computing for Cloud Infrastructures Download Package

Cloudlet-screen computing: A multi-core-based, cloud-computing-oriented, traditional-computing-compatible parallel computing Paradigm for the masses

clpeak – peak performance of your opencl device Package

clRNG: A Random Number API with Multiple Streams for OpenCL Download Package

clSPARSE: A Vendor-Optimized Open-Source Sparse BLAS Library Download Package

clSpMV: A Cross-Platform OpenCL SpMV Framework on GPUs Download Package

CLTestCheck: Measuring Test Effectiveness for GPU Kernels Download Package

cltorch: a Hardware-Agnostic Backend for the Torch Deep Neural Network Library, Based on OpenCL Download Package

CLTune: A Generic Auto-Tuner for OpenCL Kernels Download Package

ClusCo: clustering and comparison of protein models Download Package

Cluster and Fast-Update Simulations of Regular and Rewired Lattice Ising Models Using CUDA and Graphical Processing Units Download

Cluster versus GPU implementation of an Orthogonal Target Detection Algorithm for Remotely Sensed Hyperspectral Images Download

Cluster-Level Tuning of a Shallow Water Equation Solver on the Intel MIC Architecture Download

Cluster-SkePU: A Multi-Backend Skeleton Programming Library for GPU Clusters Download

Clustering Based Search Algorithm For Motion Estimation Download

Clustering billions of data points using GPUs Download

Clustering coefficient queries on massive dynamic social networks

Clustering on GPU – A Brief Survey Download

Clustering Throughput Optimization on the GPU Download

ClusterWatch: Flexible, Lightweight Monitoring for High-end GPGPU Clusters Download

CMA-ES for Hyperparameter Optimization of Deep Neural Networks Download Package

CMCpy: Genetic Code-Message Coevolution Models in Python Download Package

CMLCompiler: A Unified Compiler for Classical Machine Learning Download

CnC-CUDA: declarative programming for GPUs Download

CNN2Gate: An Implementation of Convolutional Neural Networks Inference on FPGAs with Automated Design Space Exploration Download

CNNLab: a Novel Parallel Framework for Neural Networks using GPU and FPGA-a Practical Study with Trade-off Analysis Download

Co-design of a particle-in-cell plasma simulation code for Intel Xeon Phi: a first look at Knights Landing Download

Co-processing SPMD Computation on GPUs and CPUs on Shared Memory System Download

Co-processor acceleration of an unmodified parallel solid mechanics code with FEASTGPU Download

Co-tuning of Software Specializers and Hardware Accelerators within a CNN Application Download

Coalition Structure Generation with the Graphic Processor Unit Download

Coalition Structure Generation with the Graphics Processing Unit Download Package

Coarse grain computation-communication overlap for efficient application-level checkpointing for GPUs

 

Brief statistics for this page

Titles: 100

Download open PDFs: 95

Package packages: 36

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: