https://hgpu.org/?p=12465
Improving Performance and Energy Consumption of Runtime Schedulers for Dense Linear Algebra