https://hgpu.org/?p=18098
Optimization of Hierarchical Matrix Computation on GPU