https://hgpu.org/?p=26720
Analytical Performance Estimation during Code Generation on Modern GPUs