high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Physics » Evaluating kernels on Xeon Phi to accelerate Gysela application

Evaluating kernels on Xeon Phi to accelerate Gysela application

G. Latu, M. Haefele, J. Bigot, V. Grandgirard, T. Cartier-Michaud, F. Rozar

CEA Cadarache, F-13108 Saint-Paul-les-Durance Cedex

arXiv:1503.04645 [physics.comp-ph], (16 Mar 2015)

@article{latu2015evaluating,

title={Evaluating kernels on Xeon Phi to accelerate Gysela application},

author={Latu, G. and Haefele, M. and Bigot, J. and Grandgirard, V. and Cartier-Michaud, T. and Rozar, F.},

year={2015},

month={mar},

archivePrefix={"arXiv"},

primaryClass={physics.comp-ph}

}

Download (PDF)

View

Source

1908

views

This work describes the challenges presented by porting parts ofthe Gysela code to the Intel Xeon Phi coprocessor, as well as techniques used for optimization, vectorization and tuning that can be applied to other applications. We evaluate the performance of somegeneric micro-benchmark on Phi versus Intel Sandy Bridge. Several interpolation kernels useful for the Gysela application are analyzed and the performance are shown. Some memory-bound and compute-bound kernels are accelerated by a factor 2 on the Phi device compared to Sandy architecture. Nevertheless, it is hard, if not impossible, to reach a large fraction of the peek performance on the Phi device,especially for real-life applications as Gysela. A collateral benefit of this optimization and tuning work is that the execution time of Gysela (using 4D advections) has decreased on a standard architecture such as Intel Sandy Bridge.

Tags: Computational Physics, Intel Xeon Phi, Physics

March 22, 2015 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

* * *

high performance computing on graphics processing units: hgpu.org

Evaluating kernels on Xeon Phi to accelerate Gysela application

Your response

Recent source codes

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Vortex-Optimized Light-weight Toolchain (VOLT)

SciDef: Automated Definition Extraction from Scientific Literature

bioagent-bench: Benchmark for evaluating LLM agents in bioinformatics

Benchmark suite for LLM inference on NVIDIA consumer GPUs

Theorizer: from the paper Generating Literature-Driven Scientific Discoveries at Scale

Nsight Python: a Python kernel profiling interface based on NVIDIA Nsight Tools

Awesome LLM-Driven Kernel Generation

Most viewed papers (last 30 days)

Evaluating kernels on Xeon Phi to accelerate Gysela application

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)