Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping
Software Pathfinding and Innovations, Software and Services Group, Intel Corporation, Hudson, MA 01749
42nd Annual IEEE/ACM International Symposium on Microarchitecture, 2009. MICRO-42, p.45-55
@conference{luk2010qilin,
title={Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping},
author={Luk, C.K. and Hong, S. and Kim, H.},
booktitle={Microarchitecture, 2009. MICRO-42. 42nd Annual IEEE/ACM International Symposium on},
pages={45–55},
issn={1072-4451},
year={2010},
organization={IEEE}
}
Heterogeneous multiprocessors are increasingly important in the multi-core era due to their potential for high performance and energy efficiency. In order for software to fully realize this potential, the step that maps computations to processing elements must be as automated as possible. However, the state-of-the-art approach is to rely on the programmer to specify this mapping manually and statically. This approach is not only labor intensive but also not adaptable to changes in runtime environments like problem sizes and hardware/software configurations. In this study, we propose adaptive mapping, a fully automatic technique to map computations to processing elements on a CPU+GPU machine. We have implemented it in our experimental heterogeneous programming system called Qilin. Our results show that, by judiciously distributing works over the CPU and GPU, automatic adaptive mapping achieves a 25% reduction in execution time and a 20% reduction in energy consumption than static mappings on average for a set of important computation benchmarks. We also demonstrate that our technique is able to adapt to changes in the input problem size and system configuration.
January 12, 2011 by hgpu