https://hgpu.org/?p=7442
An implementation of the tile QR factorization for a GPU and multiple CPUs