https://hgpu.org/?p=6971
A High Performance Parallel FDTD Method Enhanced By Using SSE Instruction Set