https://hgpu.org/?p=1288
Towards Dense Linear Algebra for Hybrid GPU Accelerated Manycore Systems