https://hgpu.org/?p=15070
A Semi-Automated Tool Flow for Roofline Anaylsis of OpenCL Kernels on Accelerators