Towards Performance Portable Programming for Distributed Heterogeneous Systems

hgpu.org » Applications » Computer science » Towards Performance Portable Programming for Distributed Heterogeneous Systems

Towards Performance Portable Programming for Distributed Heterogeneous Systems

Polykarpos Thomadakis, Nikos Chrisochoides

Department of Computer Science, Old Dominion University, Norfolk, Virginia

arXiv:2210.01238 [cs.DC], (3 Oct 2022)

DOI:10.48550/arXiv.2210.01238

BibTeX

Download (PDF)

View

Source

Source codes

Package:

Heterogeneous PREMA

970

views

Hardware heterogeneity is here to stay for high-performance computing. Large-scale systems are currently equipped with multiple GPU accelerators per compute node and are expected to incorporate more specialized hardware in the future. This shift in the computing ecosystem offers many opportunities for performance improvement; however, it also increases the complexity of programming for such architectures. This work introduces a runtime framework that enables effortless programming for heterogeneous systems while efficiently utilizing hardware resources. The framework is integrated within a distributed and scalable runtime system to facilitate performance portability across heterogeneous nodes. Along with the design, this paper describes the implementation and optimizations performed, achieving up to 300% improvement in a shared memory benchmark and up to 10 times in distributed device communication. Preliminary results indicate that our software incurs low overhead and achieves 40% improvement in a distributed Jacobi proxy application while hiding the idiosyncrasies of the hardware.

Tags: Computer science, CUDA, Heterogeneous systems, nVidia, OpenCL, Package, Performance, performance portability, Tesla V100

October 9, 2022 by hgpu

No votes yet.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org