https://hgpu.org/?p=24349
Design, Implementation and Test of Efficient GPU to GPU Communication Methods