Power-performance comparison of single-task driven many-cores
Department of Electrical and Computer Engineering, University of Maryland
IEEE 17th International Conference on Parallel and Distributed Systems (ICPADS), 2011
@inproceedings{keceli2011power,
title={Power-performance comparison of single-task driven many-cores},
author={Keceli, F. and Moreshet, T. and Vishkin, U.},
booktitle={2011 IEEE 17th International Conference on Parallel and Distributed Systems},
pages={348–355},
year={2011},
organization={IEEE}
}
Many-cores, processors with 100s of cores, are becoming increasingly popular in general-purpose computing, yet power is a limiting factor in their performance. In this paper, we compare the power and performance of two design points in the many-core processor domain. The XMT general-purpose processor provides significant runtime advantage on irregular parallel programs (e.g., graph algorithms). This was previously demonstrated and tied to its architecture choices and ease-of-programming. In contrast, current commercial GPUs excel at regular parallel programs that require high processing capability. In this work, we set the power envelope as a constraint and evaluate an envisioned 1024-core XMT processor against an NVIDIA GTX280 GPU considering various scenarios for estimating the power of the XMT chip. Even under worst-case assumptions and scenarios, simulations show that the XMT processor sustains its advantage over the GPU on irregular parallel programs, while not falling significantly behind on regular programs. The total energy spent per benchmark fits a similar pattern. Given that the two architectures target different types of parallelism, a future system can potentially utilize an XMT chip and a GPU chip in complementary roles.
January 11, 2012 by hgpu