high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » CUDA » Blasting through lattice calculations using CUDA

Blasting through lattice calculations using CUDA

Kipton Barros, Ronald Babich, Richard Brower, Michael A. Clark, Claudio Rebbi

Department of Physics, Boston University, Boston, MA 02215

XXVI International Symposium on Lattice Field Theory (Lattice 2008), Williamsburg, Virginia, July 14-19, 2008, arXiv:0810.5365 [hep-lat] (29 Oct 2008)

@article{2008arXiv0810.5365B,

author={Barros}, K. and {Babich}, R. and {Brower}, R. and {Clark}, M.~A. and {Rebbi}, C.},

title={“{Blasting through lattice calculations using CUDA}”},

journal={ArXiv e-prints},

archivePrefix={“arXiv”},

eprint={0810.5365},

primaryClass={“hep-lat”},

keywords={High Energy Physics – Lattice},

year={2008},

month={oct},

adsurl={http://adsabs.harvard.edu/abs/2008arXiv0810.5365B},

adsnote={Provided by the SAO/NASA Astrophysics Data System}

}

Download (PDF)

View

Source

2205

views

Modern graphics hardware is designed for highly parallel numerical tasks and provides significant cost and performance benefits. Graphics hardware vendors are now making available development tools to support general purpose high performance computing. Nvidia’s CUDA platform, in particular, offers direct access to graphics hardware through a programming language similar to C. Using the CUDA platform we have implemented a Wilson-Dirac operator which runs at an effective 68 Gflops on the Tesla C870. The recently released GeForce GTX 280 runs this same code at 92 Gflops, and we expect further improvement pending code optimization.

Tags: CUDA, High Energy Physics – Lattice, Monte Carlo simulation, nVidia, nVidia GeForce GTX 280, Physics, QCD, Tesla C870

January 18, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Blasting through lattice calculations using CUDA

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Blasting through lattice calculations using CUDA

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)