TEG: GPU Performance Estimation Using a Timing Model

Junjie Lai, Andre Seznec
Project-Team ALF
Research Report no 7804, 2011




   title={TEG: GPU Performance Estimation Using a Timing Model},

   author={Lai, Junjie and Seznec, Andr{‘e}},


   affiliation={ALF – INRIA – IRISA – INRIA – Universit{‘e} de Rennes I},

   type={Rapport de recherche},







Download Download (PDF)   View View   Source Source   



Modern Graphic Processing Units (GPUs) offer significant performance speedup over conventional processors. Programming on GPU for general purpose applications has become an important research area. CUDA programming model provides a C-like interface and is widely accepted. However, since hardware vendors do not disclose enough underlying architecture details, programmers have to optimize their applications without fully understanding the performance characteristics. In this paper we present a GPU timing model to provide more insights into the applications’ performance on GPU. A GPU CUDA program timing estimation tool (TEG) is developed based on the GPU timing model. Especially, TEG illustrates how performance scales from one warp (CUDA thread group) to multiple concurrent warps on SM (Streaming Multiprocessor). Because TEG takes the native GPU assembly code as input, it allows to estimate the execution time with only a small error. TEG can help programmers to better understand the performance results and quantify bottlenecks’ performance effects.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: