Performance analysis of SSE instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems

hgpu.org » Programming » CUDA » Performance analysis of SSE instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems

Performance analysis of SSE instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems

Jorge Frances Monllor, Sergio Bleda Perez, Andres Marquez Ruiz, Cristian Neipp Lopez, Sergi Gallego Rico, Beatriz Otero Calvino, Augusto Belendez Vazquez

Universidad de Alicante. Departamento de Fisica, Ingenieria de Sistemas y Teoria de la Senal

13th International Conference on Mathematical Methods in Science and Engineering, 2013

@article{frances2013performance,

title={Performance analysis of SSE instructions in multi-core CPUs and GPU computing on FDTD scheme for solid and fluid vibration problems},

author={Franc{‘e}s Monllor, Jorge and Bleda P{‘e}rez, Sergio and M{‘a}rquez Ruiz, Andr{‘e}s and Neipp L{‘o}pez, Cristian and Gallego Rico, Sergi and Otero Calvi{~n}o, Beatriz and Bel{‘e}ndez V{‘a}zquez, Augusto and others},

year={2013},

publisher={CMMSE}

}

Download (PDF)

View

Source

2856

views

In this work a unified treatment of solid and fluid vibration problems is developed by means of the Finite-Difference Time-Domain (FDTD). The scheme here proposed introduces a scaling factor in the velocity fields that improves the performance of the method and the vibration analysis in heterogenous media. In order to accurately reproduce the interaction of fluids and solids in FDTD both time and spatial resolutions must be reduced compared with the set up used in acoustic FDTD problems. This aspect implies the use of bigger grids and hence more time and memory resources. For reducing the time simulation costs, FDTD code has been adapted in order to exploit the resources available in modern parallel architectures. For CPUs the implicit usage of the streaming SIMD (Singe Instruction Multiple Data) extensions in multi-core CPUs has been considered. In addition, the computation has been distributed along the different cores available by means of OpenMP directives. Graphic Processing Units (GPU) have been also considered and the degree of improvement achieved by means of this parallel architecture has been compared with the highly-tuned CPU scheme by means of the relative speed up. The speed up obtained by the parallel versions implemented were up to 7 and 30 times faster than the best sequential version for CPU and GPU respectively. Results obtained with both parallel approaches demonstrate that parallel programming techniques are mandatory in solid-vibration problems with FDTD.

Tags: CUDA, FDTD, Finite-difference time-domain, Fluid dynamics, nVidia, nVidia GeForce GTX 470, Programming techniques

September 15, 2013 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

high performance computing on graphics processing units: hgpu.org