https://hgpu.org/?p=18049
NVIDIA Tensor Core Programmability, Performance & Precision