https://hgpu.org/?p=8742
Performance comparison of gauss-Jordan elimination method using OpenMP and CUDA