HPerf: A Lightweight Profiler for Task Distribution on CPU+GPU Platforms

Joo Hwan Lee, Nimit Nigania, Hyesoon Kim, Bevin Brett
School of Computer Science, Georgia Institute of Technology
SCS Technical Report GT-CS-15-04, 2015


   title={HPerf: A Lightweight Profiler for Task Distribution on CPU+ GPU Platforms},

   author={Lee, Joo Hwan and Nigania, Nimit and Kim, Hyesoon and Brett, Bevin},


   publisher={Georgia Institute of Technology}


Download Download (PDF)   View View   Source Source   



Heterogeneous computing has emerged as one of the major computing platforms in many domains. Although there have been several proposals to aid programming for heterogeneous computing platforms, optimizing applications on heterogeneous computing platforms is not an easy task. Identifying which parallel regions (or tasks) should run on GPUs or CPUs is one of the critical decisions to improve performance. In this paper, we propose a profiler, HPerf, to identify an efficient task distribution on CPUs+GPUs system with low profiling overhead. HPerf is a hierarchical profiler. First it performs lightweight profiling and then if necessary, it performs detailed profiling to measure caching and data transfer cost. Compared to a brute-force approach, HPerf reduces the profiling overhead significantly and compared to a naive decision, HPerf improves the performance of OpenCL applications up to 25%.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: