https://hgpu.org/?p=5919
Asynchronous Communication for Finite-Difference Simulations on GPU Clusters using CUDA and MPI