https://hgpu.org/?p=2524
A Multi-Stage CUDA Kernel for Floyd-Warshall