high performance computing on graphics processing units: hgpu.org

hgpu.org » Programming » Algorithms » Reusing Auto-Schedules for Efficient DNN Compilation

Reusing Auto-Schedules for Efficient DNN Compilation

Perry Gibson, José Cano

School of Computing Science, University of Glasgow, UK

arXiv:2201.05587 [cs.LG], (14 Jan 2022)

@misc{gibson2022reusing,

title={Reusing Auto-Schedules for Efficient DNN Compilation},

author={Perry Gibson and José Cano},

year={2022},

eprint={2201.05587},

archivePrefix={arXiv},

primaryClass={cs.LG}

}

Download (PDF)

View

Source

1219

views

Auto-scheduling is a process where a search algorithm automatically explores candidate schedules (program transformations) for a given tensor program on a given hardware platform to improve its performance. However this can be a very time consuming process, depending on the complexity of the tensor program, and capacity of the target device, with often many thousands of program variants being explored. To address this, in this paper we introduce and demonstrate the idea of tuning-reuse, a novel approach to identify and re-use auto-schedules between tensor programs. We demonstrate this concept using Deep Neural Networks (DNNs), taking sets of auto-schedules from pre-tuned DNNs, and using them to reduce the inference time of a new DNN. Given a set of pre-tuned schedules, tuning-reuse provides its maximum speedup in less time than auto-scheduling using the state-of-the-art Ansor auto-scheduler. On a set of widely used DNN models, we apply tuning-reuse and achieve maximum speedups between 1.16x and 4.76x, while outperforming Ansor when given limited tuning time.

Tags: Algorithms, Computer science, Machine learning, Neural networks, Performance, Programming Languages

January 23, 2022 by hgpu

No votes yet.

Please wait...