HPerf: A Lightweight Profiler for Task Distribution on CPU+GPU Platforms
School of Computer Science, Georgia Institute of Technology
SCS Technical Report GT-CS-15-04, 2015
@article{lee2015hperf,
title={HPerf: A Lightweight Profiler for Task Distribution on CPU+ GPU Platforms},
author={Lee, Joo Hwan and Nigania, Nimit and Kim, Hyesoon and Brett, Bevin},
year={2015},
publisher={Georgia Institute of Technology}
}
Heterogeneous computing has emerged as one of the major computing platforms in many domains. Although there have been several proposals to aid programming for heterogeneous computing platforms, optimizing applications on heterogeneous computing platforms is not an easy task. Identifying which parallel regions (or tasks) should run on GPUs or CPUs is one of the critical decisions to improve performance. In this paper, we propose a profiler, HPerf, to identify an efficient task distribution on CPUs+GPUs system with low profiling overhead. HPerf is a hierarchical profiler. First it performs lightweight profiling and then if necessary, it performs detailed profiling to measure caching and data transfer cost. Compared to a brute-force approach, HPerf reduces the profiling overhead significantly and compared to a naive decision, HPerf improves the performance of OpenCL applications up to 25%.
March 8, 2015 by hgpu