high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Electrodynamics » Performance evaluation of the multi-device OpenCL FDTD solver

Performance evaluation of the multi-device OpenCL FDTD solver

Tomasz P. Stefanski, Nicolas Chavannes, Niels Kuster

ETH Zurich, Integrated Systems Laboratory, Gloriastrasse 35, 8092, Switzerland

Proceedings of the 5th European Conference on Antennas and Propagation (EUCAP), 2011

@inproceedings{stefanskiperformance,

title={Performance evaluation of the multi-device OpenCL FDTD solver},

author={Stefanski, T.P. and Chavannes, N. and Kuster, N.},

booktitle={Antennas and Propagation (EUCAP), Proceedings of the 5th European Conference on},

pages={3995–3998},

organization={IEEE},

year={2011}

}

Source

2365

views

We present results of an evaluation of a multi-device OpenCL FDTD solver. Portability between hardware manufactured by different vendors and also between highly specialized and parallel computing architectures available on the market, i.e. GPUs, multi-core CPUs and devices integrating both technologies in a single-die IC, is the main advantage of this solver. For code execution on GPUs, the computational domain is decomposed along the slowest direction, and electromagnetic field boundary data is shared between neighboring subdomains. The communication overhead between GPUs is proportional to the area of the boundary and represents the rate-limiting step of the method. Utilized hardware devices allow the communication overhead to be hidden by computations for sufficiently large simulation domains, giving a scaling efficiency higher than 90%. CPUs placed in different sockets on a motherboard are visible by the OpenCL driver as a single computing device with an aggregated number of cores, thus decomposition of the domain is not necessary for solver execution on multi-core CPUs. The paper subsequently shows results of numerical tests aimed at evaluation of the developed code in realistic simulations of problems in computational electromagnetics.

Tags: Electrodynamics, FDTD, Finite-difference time-domain, GPU cluster, OpenCL

June 21, 2011 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

high performance computing on graphics processing units: hgpu.org

Performance evaluation of the multi-device OpenCL FDTD solver

Your response

Recent source codes

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

CuTile Benchmark Suite: Performance and Productivity Tradeoffs for GPU Kernel Programming on Blackwell Architecture

Agentic Code Optimization via Compiler-LLM Cooperation

Most viewed papers (last 30 days)

Performance evaluation of the multi-device OpenCL FDTD solver

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)