https://hgpu.org/?p=18931
Diagnosing Performance Bottlenecks in HPC Applications