A Comparison of GPU Execution Time Prediction using Machine Learning and Analytical Modeling

hgpu.org » Applications » Computer science » A Comparison of GPU Execution Time Prediction using Machine Learning and Analytical Modeling

A Comparison of GPU Execution Time Prediction using Machine Learning and Analytical Modeling

Marcos Amaris, Raphael Y. de Camargo, Mohamed Dyab, Alfredo Goldman, Denis Trystram

Institute of Mathematics and Statistics, University of Sao Paulo, Sao Paulo, Brazil

15th IEEE International Symposium on Network Computing and Applications, 2016

BibTeX

Download (PDF)

View

Source

Source codes

Package:

svm-gpuperf: Machine Learning to Predict Performance in GPU Applications

2628

views

Today, most high-performance computing (HPC) platforms have heterogeneous hardware resources (CPUs, GPUs, storage, etc.) A Graphics Processing Unit (GPU) is a parallel computing coprocessor specialized in accelerating vector operations. The prediction of application execution times over these devices is a great challenge and is essential for efficient job scheduling. There are different approaches to do this, such as analytical modeling and machine learning techniques. Analytic predictive models are useful, but require manual inclusion of interactions between architecture and software, and may not capture the complex interactions in GPU architectures. Machine learning techniques can learn to capture these interactions without manual intervention, but may require large training sets. In this paper, we compare three different machine learning approaches: linear regression, support vector machines and random forests with a BSP-based analytical model, to predict the execution time of GPU applications. As input to the machine learning algorithms, we use profiling information from 9 applications executed over 9 different GPUs. We show that machine learning approaches provide reasonable predictions for different cases. Although the predictions were inferior to the analytical model, they required no detailed knowledge of application code, hardware characteristics or explicit modeling. Consequently, whenever a database with profile information is available or can be generated, machine learning techniques can be useful for deploying automated on-line performance prediction for scheduling applications on heterogeneous architectures containing GPUs.

Tags: Analytical model, Benchmarking, Computer science, Heterogeneous systems, Machine learning, nVidia, nVidia GeForce GTX 680, nVidia GeForce GTX 970, nVidia GeForce GTX 980, nVidia GeForce GTX Titan, nVidia GeForce GTX Titan Black, nVidia GeForce GTX Titan X, nVidia Quadro K5200, Package, Performance, Tesla K20, Tesla K40

December 6, 2016 by hgpu

No votes yet.

Please wait...

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org