https://hgpu.org/?p=28101
PopSparse: Accelerated block sparse matrix multiplication on IPU