19942

ADWPNAS: Architecture-Driven Weight Prediction for Neural Architecture Search

Xu Zhang, Chenjun Zhou, Bo Gu
SYSU
arXiv:2003.01335 [cs.NE], (3 Mar 2020)

@misc{xuzhang2020adwpnas,

   title={ADWPNAS: Architecture-Driven Weight Prediction for Neural Architecture Search},

   author={XuZhang and ChenjunZhou and BoGu},

   year={2020},

   eprint={2003.01335},

   archivePrefix={arXiv},

   primaryClass={cs.NE}

}

Download Download (PDF)   View View   Source Source   

1157

views

How to discover and evaluate the true strength of models quickly and accurately is one of the key challenges in Neural Architecture Search (NAS). To cope with this problem, we propose an Architecture-Driven Weight Prediction (ADWP) approach for neural architecture search (NAS). In our approach, we first design an architecture-intensive search space and then train a HyperNetwork by inputting stochastic encoding architecture parameters. In the trained HyperNetwork, weights of convolution kernels can be well predicted for neural architectures in the search space. Consequently, the target architectures can be evaluated efficiently without any finetuning, thus enabling us to search fortheoptimalarchitectureinthespaceofgeneralnetworks (macro-search). Through real experiments, we evaluate the performance of the models discovered by the proposed AD-WPNAS and results show that one search procedure can be completed in 4.0 GPU hours on CIFAR-10. Moreover, the discovered model obtains a test error of 2.41% with only 1.52M parameters which is superior to the best existing models.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: