A Research of MapReduce with GPU Acceleration
School of Information Science and Engineering, Yunnan University, Kunming, Yunnan, China
18th PDPTA, 2012
@article{xin2012research,
title={A Research of MapReduce with GPU Acceleration},
author={Xin, M. and Li, H. and Lu, J.},
year={2012}
}
MapReduce is an efficient distributed computing model on large data sets. The data processing is fully distributed on huge amount of nodes, and a MapReduce cluster is of highly scalable. However, single-node performance is gradually to be a bottleneck in computeintensive jobs, which makes it difficult to extend the MapReduce model to wider application fields such as largescale image processing and image mining. As an attempt, this paper presents an approach of GPU-accelerated MapReduce, which is implemented by Hadoop and OpenCL. Being a distinctive feature, it aims at general and inexpensive hardware platform, and it is seamlessly integrated with Apache Hadoop, the most widely used MapReduce framework. As a heterogeneous multi-machine and many-core architecture, it targets at both data- and compute-intensive applications. An almost 2 times performance improvement has been validated, without any special optimization.
August 28, 2012 by hgpu