Performance benchmarking of deep learning framework on Intel Xeon Phi
Department of Computer Science, Tunghai University, Taichung 40704, Taiwan, ROC
The Journal of Supercomputing (2020)
@article{yang2020performance,
title={Performance benchmarking of deep learning framework on Intel Xeon Phi},
author={Yang, Chao-Tung and Liu, Jung-Chun and Chan, Yu-Wei and Kristiani, Endah and Kuo, Chan-Fu},
journal={The Journal of Supercomputing},
pages={1–25},
year={2020},
publisher={Springer}
}
With the success of deep learning (DL) methods in diverse application domains, several deep learning software frameworks have been proposed to facilitate the usage of these methods. By knowing the frameworks which are employed in big data analysis, the analysis process will be more efficient in terms of time and accuracy. Thus, benchmarking DL software frameworks is in high demand. This paper presents a comparative study of deep learning frameworks, namely Caffe and TensorFlow on performance metrics: runtime performance and accuracy. This study is performed with several datasets, such as LeNet MNIST classification model, CIFAR-10 image recognition datasets and message passing interface (MPI) parallel matrix-vector multiplication. We evaluate the performance of the above frameworks when employed on machines of Intel Xeon Phi 7210. In this study, the use of vectorization, OpenMP parallel processing, and MPI are examined to improve the performance of deep learning frameworks. The experimental results show the accuracy comparison between the number of iterations of the test in the training model and the training time on the different machines before and after optimization. In addition, an experiment on two multi-nodes of Xeon Phi is performed. The experimental results also show the optimization of Xeon Phi is beneficial to the Caffe and TensorFlow frameworks.
June 28, 2020 by hgpu