high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » CUDA » Exact diagonalization of quantum lattice models on coprocessors

Exact diagonalization of quantum lattice models on coprocessors

Topi Siro, Ari Harju

Aalto University School of Science, P.O. Box 14100, 00076 Aalto, Finland

arXiv:1511.00863 [cond-mat.str-el], (3 Nov 2015)

@article{siro2015exact,

title={Exact diagonalization of quantum lattice models on coprocessors},

author={Siro, Topi and Harju, Ari},

year={2015},

month={nov},

archivePrefix={"arXiv"},

primaryClass={cond-mat.str-el}

}

Download (PDF)

View

Source

2166

views

We implement the Lanczos algorithm on an Intel Xeon Phi coprocessor and compare its performance to a multi-core Intel Xeon CPU and an NVIDIA graphics processor. The Xeon and the Xeon Phi are parallelized with OpenMP and the graphics processor is programmed with CUDA. The performance is evaluated by measuring the execution time of a single step in the Lanczos algorithm. We study two quantum lattice models with different particle numbers, and conclude that for small systems, the multi-core CPU is the fastest platform, while for large systems, the graphics processor is the clear winner, reaching speedups of up to 7.6 compared to the CPU. The Xeon Phi outperforms the CPU with sufficiently large particle number, reaching a speedup of 2.5.

Tags: Condensed matter, CUDA, Intel Xeon Phi, Mathematical Software, nVidia, OpenMP, Physics, Tesla K40

November 4, 2015 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

* * *

high performance computing on graphics processing units: hgpu.org

Exact diagonalization of quantum lattice models on coprocessors

Your response

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)

Exact diagonalization of quantum lattice models on coprocessors

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)