Generating SU(Nc) pure gauge lattice QCD configurations on GPUs with CUDA and OpenMP

Nuno Cardoso, Pedro Bicudo
CFTP, Departamento de Fisica, Instituto Superior Tecnico, Universidade Tecnica de Lisboa, Av. Rovisco Pais, 1049-001 Lisbon, Portugal
arXiv:1112.4533v1 [hep-lat] (20 Dec 2011)


   author={Cardoso}, N. and {Bicudo}, P.},

   title={"{Generating SU(Nc) pure gauge lattice QCD configurations on GPUs with CUDA and OpenMP}"},

   journal={ArXiv e-prints},




   keywords={High Energy Physics – Lattice, Physics – Computational Physics},




   adsnote={Provided by the SAO/NASA Astrophysics Data System}


The starting point of any lattice QCD computation is the generation of a Markov chain of gauge field configurations. Due to the large number of lattice links and due to the matrix multiplications, generating SU(Nc) lattice QCD configurations is a highly demanding computational task, requiring advanced computer parallel architectures such as clusters of several Central Processing Units (CPUs) or Graphics Processing Units (GPUs). In this paper we present and explore the performance of CUDA codes for NVIDIA GPUs to generate SU(Nc) lattice QCD pure gauge configurations. Our implementation in one GPU uses CUDA and in multiple GPUs uses OpenMP and CUDA. We present optimized CUDA codes SU(2), SU(3) and SU(4). We also show a generic SU(Nc) code for Nc$,geq 4$ and compare it with the optimized version of SU(4). Our codes are publicly available for free use by the lattice QCD community.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: