https://hgpu.org/?p=14133
Generating Efficient Tensor Contractions for GPUs