Performance analysis of multi-core CPUs and GPU computing on SF-FDTD scheme for third order nonlinear materials and periodic media

hgpu.org » Programming » CUDA » Performance analysis of multi-core CPUs and GPU computing on SF-FDTD scheme for third order nonlinear materials and periodic media

Performance analysis of multi-core CPUs and GPU computing on SF-FDTD scheme for third order nonlinear materials and periodic media

Jorge Frances Monllor, Sergio Bleda Perez, Jani Tervo, Cristian Neipp Lopez, Andres Marquez Ruiz, Inmaculada Pascual Villalobos, Augusto Belendez Vazquez

Departamento de Fisica, Ingenieria de Sistemas y Teoria de la Senal, Universidad de Alicante

13th International Conference on Mathematical Methods in Science and Engineering (CMMSE), 2013

@article{frances2013performance,

title={Performance analysis of multi-core CPUs and GPU computing on SF-FDTD scheme for third order nonlinear materials and periodic media},

author={Franc{‘e}s Monllor, Jorge and Bleda P{‘e}rez, Sergio and Tervo, Jani and Neipp L{‘o}pez, Cristian and M{‘a}rquez Ruiz, Andr{‘e}s and Pascual Villalobos, Inmaculada and Bel{‘e}ndez V{‘a}zquez, Augusto and others},

year={2013},

publisher={CMMSE}

}

Download (PDF)

View

Source

2148

views

The Split-Field Finite-Difference Time-Domain (SF-FDTD) scheme is an optimal formulation for modeling periodic optical media by means of a single unit period. The split-field components and the Periodic Boundary Condition (BPC) in the periodic boundaries allow to obtain successful results even with oblique angle of incidence. Under this situation the standard FDTD scheme requires multiple periods and smaller spatial and time resolutions in order to provide accurate results, thus degrading the computational performance of this method. However, considering nonlinear materials in SF-FDTD requires an iterative fixed-point procedure for solving the nonlinear system of equations. Furthermore, complex notation is needed for the split-field components and for correctly apply PBC. Although SF-FDTD is more appropriate for modeling periodic media than the standard FDTD scheme, the addition of nonlinear media degrades the overall performance of this method. In this work, the SF-FDTD formulation for third-order nonlinear materials is implemented taking into consideration parallel hardware architectures. The influence of the fixed-point procedure has been analyzed in both multi-core Central Processing Units (CPUs) and Graphic Processing Units (GPUs).The results show that highly optimized CPU version is competitive amongst GPU computing. Both optimized versions are not dramatically affected by the iterative process due to massive usage of their inner cache memories.

Tags: CUDA, Electrodynamics, FDTD, FEM, Finite element method, Finite-difference time-domain, nVidia, nVidia GeForce GTX 460, Optics

September 20, 2013 by hgpu

No votes yet.

Please wait...

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

* * *

high performance computing on graphics processing units: hgpu.org