https://hgpu.org/?p=27217
Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU