Papers on hgpu.org (.txt-file)
Exascale Deep Learning for Scientific Inverse Problems
Executing Dynamic Data Rate Actor Networks on OpenCL Platforms
Executing Process Networks on Heterogeneous Platforms using OpenCL
Execution of Compound Multi-Kernel OpenCL Computations in Multi-CPU/Multi-GPU Environments
Exercising high-level parallel programming on streams: a systems biology use case
EXOCHI: architecture and programming environment for a heterogeneous multi-core multithreaded system
Expanding the boundaries of GPU computing
Expanding the VPE-qGM Environment Towards a Parallel Quantum Simulation of Quantum Processes Using GPUs
Expansion Techniques for Collisionless Stellar Dynamical Simulations
Experience Applying Fortran GPU Compilers to Numerical Weather Prediction
Experience Migrating OpenCL to SYCL: A Case Study on Searches for Potential Off-Target Sites of Cas9 RNA-Guided Endonucleases on AMD GPUs
Experience of Migrating a Parallel Graph Coloring Program from CUDA to SYCL
Experience of parallelizing cryo-EM 3D reconstruction on a CPU-GPU heterogeneous system
Experience Report: Writing A Portable GPU Runtime with OpenMP 5.1
Experience with Intel’s Many Integrated Core architecture in ATLAS software
Experiences Building an MLIR-based SYCL Compiler
Experiences Developing the OpenUH Compiler and Runtime Infrastructure
Experiences in Building a Composable and Functional API for Runtime SPIR-V Code Generation
Experiences in Data-Parallel Simulation and Analysis of Complex Systems with Irregular Graph Structures
Experiences in Speeding Up Computer Vision Applications on Mobile Computing Platforms
Experiences in Teaching a Specialty Multicore Computing Course
Experiences Migrating CUDA to SYCL: A Molecular Docking Case Study
Experiences Porting a Molecular Dynamics Code to GPUs on a Cray XK7
Experiences with Achieving Portability across Heterogeneous Architectures
Experiences with Cell-BE and GPU for Tomography
Experiences with High-Level Programming Directives for Porting Applications to GPUs
Experiences with hybrid clusters
Experiences with implementing Kokkos’ SYCL backend
Experiences with Mapping Non-linear Memory Access Patterns into GPUs
Experimental Evaluation of Multiprecision Strategies for GMRES on GPUs
Experimental Evaluation of Thread Distribution Effects on Multiple Output Errors in GPUs
Experimental Fault-Tolerant Synchronization for Reliable Computation on Graphics Processors
Experimentation Procedure for Offloaded Mini-Apps Executed on Cluster Architectures with Xeon Phi Accelerators
Experiments on Parallel Training of Deep Neural Network using Model Averaging
Experiments with Massively Parallel Matrix Multiplication
Experiments with Single Core, Multi-core, and GPU Based Computation of Cellular Automata
Explainable Deep Behavioral Sequence Clustering for Transaction Fraud Detection
Explicit Cache Management for Volume Ray-Casting on Parallel Architectures
Explicit caching HYB: a new high-performance SpMV framework on GPGPU
Explicit Control of Vector Field Based Shape Deformations
Explicit Fourth-Order Runge-Kutta Method on Intel Xeon Phi Coprocessor
Explicit Integration with GPU Acceleration for Large Kinetic Networks
Explicit platform descriptions for heterogeneous many-core architectures
Explicit Shallow Water Simulations on GPUs: Guidelines and Best Practices
Exploded Views for Volume Data
Exploitation of GPUs for the Parallelisation of Probably Parallel Legacy Code
Exploiting BSP Abstractions for Compiler Based Optimizations of GPU Applications on multi-GPU Systems
Exploiting co-execution with oneAPI: heterogeneity from a modern perspective
Exploiting Coarse-grained Parallelism in B+ Tree Searches on an APU
Exploiting Computational Resources in Distributed Heterogeneous Platforms
Exploiting Computing Power on Graphics Processing Unit
Exploiting Concurrency Patterns with Heterogeneous Task and Data Parallelism
Exploiting Concurrent GPU Operations for Efficient Work Stealing on Multi-GPUs
Exploiting concurrent kernel execution on graphic processing units
Exploiting contextual information for image re-ranking and rank aggregation
Exploiting Data Parallelism in GPUs
Exploiting Data Parallelism in the yConvex Hypergraph Algorithm for Image Representation using GPGPUs
Exploiting dynamic sparse matrices for performance portable linear algebra operations
Exploiting frame-to-frame coherence for accelerating high-quality volume raycasting on graphics hardware
Exploiting GPU On-chip Shared Memory for Accelerating Schedulability Analysis
Exploiting GPU Parallelism to Optimize Real-World Problems
Exploiting GPUs to investigate an inversion method that retrieves cardiac conductivities from potential measurements
Exploiting Graphic Processing Units Parallelism to Improve Intelligent Data Acquisition System Performance in JET’s Correlation Reflectometer
Exploiting graphical processing units for data-parallel scientific applications
Exploiting graphics processing units for computational biology and bioinformatics
Exploiting Heterogeneity for Energy Efficiency in Chip Multiprocessors
Exploiting Heterogeneous Computing Platforms By Cataloging Best Solutions For Resource Intensive Seismic Applications
Exploiting Heterogeneous Systems: Keccak on OpenCL
Exploiting Hyper-Loop Parallelism in Vectorization to Improve Memory Performance on CUDA GPGPU
Exploiting Limited Access Distance of ODE Systems for Parallelism and Locality in Explicit Methods
Exploiting Memory Access Patterns to Improve Memory Performance in Data-Parallel Architectures
Exploiting More Parallelism from Applications Having Generalized Reductions on GPU Architectures
Exploiting multi-level parallelism in streaming applications for heterogeneous platforms with GPUs
Exploiting Multi-level Parallelism on a Many-core System for the Application of Hyperheuristics to a Molecular Docking Problem
Exploiting Multiple Levels of Parallelism and Online Refinement of Unstructured Meshes in Atmospheric Model Application
Exploiting OpenMP & OpenACC to Accelerate a Molecular Docking Mini-App in Heterogeneous HPC Nodes
Exploiting parallel features of modern computer architectures in bioinformatics
Exploiting parallel features of modern computer architectures in bioinformatics: applications to genetics, structure comparison and large graph analysis
Exploiting Parallel Processing Power of GPU for High Speed Frequent Pattern Mining
Exploiting Parallelism in GPUs
Exploiting Parallelism in Iterative Irregular Maxflow Computations on GPU Accelerators
Exploiting Segmentation for Robust 3D Object Matching
Exploiting SIMD extensions for linear image processing with OpenCL
Exploiting Space and Time Coherence in Grid-based Sorting
Exploiting SPMD Horizontal Locality
Exploiting SPMD Horizontal Locality to Improve Memory Efficiency
Exploiting Task Parallelism with OpenCL: A Case Study
Exploiting Task-Parallelism on GPU Clusters via OmpSs and rCUDA Virtualization
Exploiting the Parallelism of Heterogeneous Systems using Dataflow Graphs on Top of OpenCL
Exploiting the Power of GPUs for Asymmetric Cryptography
Exploiting two-level parallelism by aggregating computing resources in task-based applications over accelerator-based machines
Exploiting Unexploited Computing Resources for Computational Logics
Exploiting Uniform Vector Instructions for GPGPU Performance, Energy Efficiency, and Opportunistic Reliability Enhancement
Exploration of cyber-physical systems for GPGPU computer vision-based detection of biological viruses
Exploration of Low Numeric Precision Deep Learning Inference Using Intel FPGAs
Exploration of Multifrontal Method with GPU in Power Flow Computation
Exploration of Optimization Options for Increasing Performance of a GPU Implementation of a Three-Dimensional Bilateral Filter
Exploration of Parallelization Frameworks for Computational Finance
Explorations of the Viability of ARM and Xeon Phi for Physics Processing
Titles: 100
open PDFs: 91
packages: 14