https://hgpu.org/?p=3094
Multi-GPU Performance of Incompressible Flow Computation by Lattice Boltzmann Method on GPU Cluster