https://hgpu.org/?p=17359
DeepProf: Performance Analysis for Deep Learning Applications via Mining GPU Execution Patterns