Unified schemes for directive-based GPU offloading
Information Technology Center, The University of Tokyo, 6-2-3 Kashiwanoha, Kashiwa, Chiba 277-0882, Japan
arXiv:2411.18889 [cs.DC], (28 Nov 2024)
@article{Miki_2024,
title={Unified schemes for directive-based GPU offloading},
ISSN={2169-3536},
url={http://dx.doi.org/10.1109/ACCESS.2024.3509380},
DOI={10.1109/access.2024.3509380},
journal={IEEE Access},
publisher={Institute of Electrical and Electronics Engineers (IEEE)},
author={Miki, Yohei and Hanawa, Toshihiro},
year={2024},
pages={1–1}
}
GPU is the dominant accelerator device due to its high performance and energy efficiency. Directive-based GPU offloading using OpenACC or OpenMP target is a convenient way to port existing codes originally developed for multicore CPUs. Although OpenACC and OpenMP target provide similar features, both methods have pros and cons. OpenACC has better functions and an abundance of documents, but it is virtually for NVIDIA GPUs. OpenMP target supports NVIDIA/AMD/Intel GPUs but has fewer functions than OpenACC. Here, we have developed a header-only library, Solomon (Simple Off-LOading Macros Orchestrating multiple Notations), to unify the interface for GPU offloading with the support of both OpenACC and OpenMP target. Solomon provides three types of notations to reduce users’ implementation and learning costs: intuitive notation for beginners and OpenACC/OpenMP-like notations for experienced developers. This manuscript denotes Solomon’s implementation and usage and demonstrates the GPU-offloading in N-body simulation and the three-dimensional diffusion equation. The library and sample codes are provided as open-source software and publicly and freely available.
December 8, 2024 by hgpu