https://hgpu.org/?p=25516
High Performance GPU Code Generation for Matrix-Matrix Multiplication using MLIR: Some Early Results