A Distributed Data Mining Framework Accelerated with Graphics Processing Units

Nam-Luc Tran, Quentin Dugauthier, Sabri Skhiri
Euranova R&D, Belgium
International Conference on Cloud Computing and Big Data (CloudCom-Asia), 2013


   title={A Distributed Data Mining Framework Accelerated with Graphics Processing Units},

   author={Tran, Nam-Luc and Dugauthier, Quentin and Skhiri, Sabri},



Download Download (PDF)   View View   Source Source   



In the context of processing high volumes of data, the recent developments have led to numerous models and frameworks of distributed processing running on clusters of commodity hardware. On the other side, the Graphics Processing Unit (GPU) has seen much enthusiastic development as a device for general-purpose intensive parallel computation. In this paper we propose a framework which combines both approaches and evaluates the relevance of having nodes in a distributed processing cluster that make use of GPU units for further fine-grained parallel processing. We have engineered parallel and distributed versions of two data mining problems, the naive Bayes classifier and the k-means clustering algorithm, to run on the framework and have evaluated the performance gain. Finally, we also discuss the requirements and perspectives of integrating GPUs in a distributed processing cluster, introducing a fully distributed heterogeneous computing cluster.
Rating: 2.4/5. From 4 votes.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: