General purpose lattice QCD code set Bridge++ 2.0 for high performance computing
Yukawa Institute for Theoretical Physics, Kyoto University, Japan
arXiv:2111.04457 [hep-lat], (8 Nov 2021)
@misc{akahoshi2021general,
title={General purpose lattice QCD code set Bridge++ 2.0 for high performance computing},
author={Yutaro Akahoshi and Sinya Aoki and Tatsumi Aoyama and Issaku Kanamori and Kazuyuki Kanaya and Hideo Matsufuru and Yusuke Namekawa and Hidekatsu Nemura and Yusuke Taniguchi},
year={2021},
eprint={2111.04457},
archivePrefix={arXiv},
primaryClass={hep-lat}
}
Bridge++ is a general-purpose code set for a numerical simulation of lattice QCD aiming at a readable, extensible, and portable code while keeping practically high performance. The previous version of Bridge++ is implemented in double precision with a fixed data layout. To exploit the high arithmetic capability of new processor architecture, we extend the Bridge++ code so that optimized code is available as a new branch, i.e., an alternative to the original code. This paper explains our strategy of implementation and displays application examples to the following architectures and systems: Intel AVX-512 on Xeon Phi Knights Landing, Arm A64FX-SVE on Fujitsu A64FX (Fugaku), NEC SX-Aurora TSUBASA, and GPU cluster with NVIDIA V100.
November 14, 2021 by hgpu