high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » CUDA » SU(2) Lattice QCD Simulations on Fermi GPUs

SU(2) Lattice QCD Simulations on Fermi GPUs

Nuno Cardoso, Pedro Bicudo

CFTP, Departamento de Fisica, Instituto Superior Tecnico, Av. Rovisco Pais, 1049-001 Lisboa, Portugal

arXiv:1010.4834 [hep-lat] (23 Oct 2010)

@article{cardoso20102,

title={SU (2) Lattice QCD Simulations on Fermi GPUs},

author={Cardoso, N. and Bicudo, P.},

journal={Arxiv preprint arXiv:1010.4834},

year={2010}

}

Download (PDF)

View

Source

Source codes

Package:

PTQCD – Portuguese Lattice QCD Collaboration

1662

views

In this work we explore the performance of CUDA in lattice SU(2) simulations. CUDA, NVIDIA Compute Unified Device Architecture, is a hardware and software architecture developed by NVIDIA for computing on the GPU. We present an analysis and performance comparison between the GPU and CPU in single and double precision. Analysis with multiple GPUs and two different architectures (G200 and Fermi architectures) are also presented. In order to obtain a high performance, the code must be optimized for the GPU architecture, i.e., an implementation that exploits the memory hierarchy of the CUDA programming model. We produce codes for the Monte Carlo generation of SU(2) lattice QCD configurations, for the mean plaquette, for the Polyakov Loop at finite T and for the Wilson loop. We also present results for the potential using many configurations ($50 000$) without smearing and almost $2 000$ configurations with APE smearing. With two Fermi GPUs we have achieved an excellent performance of $200 times$ the speed over one CPU. We also find that, using the Fermi architecture, double precision computations for the static quark-antiquark potential are not much slower (less than $2 times$ slower) than single precision computations.

Tags: Computational Physics, CUDA, High Energy Physics – Lattice, Monte Carlo simulation, nVidia, nVidia GeForce GTX 295, nVidia GeForce GTX 480, Package, Physics

November 9, 2010 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

high performance computing on graphics processing units: hgpu.org

SU(2) Lattice QCD Simulations on Fermi GPUs

Package:

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)

SU(2) Lattice QCD Simulations on Fermi GPUs

Package:

Share this:

Recent source codes

Most viewed papers (last 30 days)