https://hgpu.org/?p=6202
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark