https://hgpu.org/?p=4596
High Performance Matrix Inversion on a Multi-core Platform with Several GPUs