high performance computing on graphics processing units: hgpu.org

hgpu.org » Applications » Computer science » Computer vision » Dynamic Distribution Pruning for Efficient Network Architecture Search

Dynamic Distribution Pruning for Efficient Network Architecture Search

Xiawu Zheng, Rongrong Ji, Lang Tang, Yan Wan, Baochang Zhang, Yongjian Wu, Yunsheng Wu, Ling Shao

Fujian Key Laboratory of Sensing and Computing for Smart City,Department of Cognitive Science, School of Information Science and Engineering, Xiamen University, Xiamen, China

arXiv:1905.13543 [cs.CV], (28 May 2019)

@misc{zheng2019dynamic,

title={Dynamic Distribution Pruning for Efficient Network Architecture Search},

author={Xiawu Zheng and Rongrong Ji and Lang Tang and Yan Wan and Baochang Zhang and Yongjian Wu and Yunsheng Wu and Ling Shao},

year={2019},

eprint={1905.13543},

archivePrefix={arXiv},

primaryClass={cs.CV}

}

Download (PDF)

View

Source

Source codes

Package:

Dynamic Distribution Pruning for Efficient Network Architecture Search

2120

views

Network architectures obtained by Neural Architecture Search (NAS) have shown state-of-the-art performance in various computer vision tasks. Despite the exciting progress, the computational complexity of the forward-backward propagation and the search process makes it difficult to apply NAS in practice. In particular, most previous methods require thousands of GPU days for the search process to converge. In this paper, we propose a dynamic distribution pruning method towards extremely efficient NAS, which samples architectures from a joint categorical distribution. The search space is dynamically pruned every a few epochs to update this distribution, and the optimal neural architecture is obtained when there is only one structure remained. We conduct experiments on two widely-used datasets in NAS. On CIFAR-10, the optimal structure obtained by our method achieves the state-of-the-art 1.9% test error, while the search process is more than 1,000 times faster (only 1.5 GPU hours on a Tesla V100) than the state-of-the-art NAS algorithms. On ImageNet, our model achieves 75.2% top-1 accuracy under the MobileNet settings, with a time cost of only 2 GPU days that is 100% acceleration over the fastest NAS algorithm. The code is available.

Tags: Computer science, Computer vision, CUDA, Deep learning, Machine learning, nVidia, Package, Tesla V100

June 5, 2019 by hgpu

No votes yet.

Please wait...

Your response

You must be logged in to post a comment.

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

* * *

high performance computing on graphics processing units: hgpu.org

Dynamic Distribution Pruning for Efficient Network Architecture Search

Package:

Your response

Recent source codes

CL4SE: A Context Learning Benchmark For Software Engineering Tasks

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels

KernelGYM & Dr. Kernel: A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations

Vortex-Optimized Light-weight Toolchain (VOLT)

SciDef: Automated Definition Extraction from Scientific Literature

bioagent-bench: Benchmark for evaluating LLM agents in bioinformatics

Benchmark suite for LLM inference on NVIDIA consumer GPUs

Theorizer: from the paper Generating Literature-Driven Scientific Discoveries at Scale

Most viewed papers (last 30 days)

Dynamic Distribution Pruning for Efficient Network Architecture Search

Package:

Share this:

Your response

Recent source codes

Most viewed papers (last 30 days)