https://hgpu.org/?p=7397
A High-Performance Parallel FDTD Method Enhanced by Using SSE Instruction Set