https://hgpu.org/?p=20105
Fireiron: A Scheduling Language for High-Performance Linear Algebra on GPUs