Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators
Department of Computer Science, Ben-Gurion University of the Negev, Israel
arXiv:2403.02735 [cs.DC], (5 Mar 2024)
@misc{fridman2024distributed,
title={Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators},
author={Yehonatan Fridman and Guy Tamir and Uri Steinitz and Gal Oren},
year={2024},
eprint={2403.02735},
archivePrefix={arXiv},
primaryClass={cs.DC}
}
Monte Carlo (MC) simulations play a pivotal role in diverse scientific and engineering domains, with applications ranging from nuclear physics to materials science. Harnessing the computational power of high-performance computing (HPC) systems, especially Graphics Processing Units (GPUs), has become essential for accelerating MC simulations. This paper focuses on the adaptation and optimization of the OpenMC neutron and photon transport Monte Carlo code for Intel GPUs, specifically the Intel Data Center Max 1100 GPU (codename Ponte Vecchio, PVC), through distributed OpenMP offloading. Building upon prior work by Tramm J.R., et al. (2022), which laid the groundwork for GPU adaptation, our study meticulously extends the OpenMC code’s capabilities to Intel GPUs. We present a comprehensive benchmarking and scaling analysis, comparing performance on Intel MAX GPUs to state-of-the-art CPU execution (Intel Xeon Platinum 8480+ Processor, codename 4th generation Sapphire Rapids). The results demonstrate a remarkable acceleration factor compared to CPU execution, showcasing the GPU-adapted code’s superiority over its CPU counterpart as computational load increases.
March 10, 2024 by hgpu