Practical Implementation of Lattice QCD Simulation on Intel Xeon Phi Knights Landing
Department of Physics, Hiroshima University, Higashi-hiroshima 739-8526, Japan
arXiv:1712.01505 [hep-lat], (5 Dec 2017)
@article{kanamori2017practical,
title={Practical Implementation of Lattice QCD Simulation on Intel Xeon Phi Knights Landing},
author={Kanamori, Issaku and Matsufuru, Hideo},
year={2017},
month={dec},
archivePrefix={"arXiv"},
primaryClass={hep-lat}
}
We investigate implementation of lattice Quantum Chromodynamics (QCD) code on the Intel Xeon Phi Knights Landing (KNL). The most time consuming part of the numerical simulations of lattice QCD is a solver of linear equation for a large sparse matrix that represents the strong interaction among quarks. To establish widely applicable prescriptions, we examine rather general methods for the SIMD architecture of KNL, such as using intrinsics and manual prefetching, to the matrix multiplication and iterative solver algorithms. Based on the performance measured on the Oakforest-PACS system, we discuss the performance tuning on KNL as well as the code design for facilitating such tuning on SIMD architecture and massively parallel machines.
December 7, 2017 by hgpu