Efficient Neural Architecture Search via Parameter Sharing

Pham, Hieu; Guan, Melody Y.; Zoph, Barret; Le, Quoc V.; Dean, Jeff

doi:10.48550/arxiv.1802.03268

Cited by 282 publications

(593 citation statements)

References 11 publications

Supporting

Mentioning

570

Contrasting

Order By: Relevance

“…For example, searching an image model for CIFAR-10 and ImageNet required 2000 GPU days of reinforcement learning (RL) [46] or 3150 GPU days of evolution [28]. ENAS [24] introduced a parameter-sharing strategy to reduce the search time. Recent differentiable NAS (DNAS) methods [20] introduced the softmax-based continuous relaxation of the architecture representation, allowing efficient search using gradient descent.…”

Section: Neural Architecture Search Methodsmentioning

confidence: 99%

“…Recently, machine-designed architectures by Neural Architecture Search (NAS) have surpassed the humandesigned ones for image recognition [28,24,33,44]. For video action recognition, the latest work X3D [9] placed a new milestone in this line: it progressively expanded a hand-crafted 2D architecture into 3D spatial-temporal ones, by expanding along multiple axes, including space, time, width, and depth.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search

Jiang¹,

Gong²,

Wu³

et al. 2021

Preprint

View full text Add to dashboard Cite

Efficient video architecture is the key to deploying video recognition systems on devices with limited computing resources. Unfortunately, existing video architectures are often computationally intensive and not suitable for such applications. The recent X3D work presents a new family of efficient video models by expanding a hand-crafted image architecture along multiple axes, such as space, time, width, and depth. Although operating in a conceptually large space, X3D searches one axis at a time, and merely explored a small set of 30 architectures in total, which does not sufficiently explore the space. This paper bypasses existing 2D architectures, and directly searched for 3D architectures in a fine-grained space, where block type, filter number, expansion ratio and attention block are jointly searched. A probabilistic neural architecture search method is adopted to efficiently search in such a large space. Evaluations on Kinetics and Something-Something-V2 benchmarks confirm our AutoX3D models outperform existing ones in accuracy up to 1.3% under similar FLOPs, and reduce the computational cost up to ×1.74 when reaching similar performance.

show abstract

Section: Neural Architecture Search Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search

Jiang¹,

Gong²,

Wu³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…In this paper, we seek to answer following questions in the context of iterative structured pruning with rewinding: (Han, Mao, and Dally 2015;Kadetotad et al 2016), knowledge distillation (Polino, Pascanu, and Alistarh 2018;Yim et al 2017), neural architecture search (Zoph and Le 2016;Pham et al 2018) and pruning (Li et al 2016;Han, Mao, and Dally 2015;Srinivas and Babu 2015;Molchanov et al 2016). There has also been substantial work in manually designing new model topology, like Mo-bileNet (Howard et al 2017) and EfficientNet (Tan and Le 2019), that are suitable for edge device deployment but are less accurate compared to traditional models like ResNet (He et al 2016).…”

Section: Our Solutionmentioning

confidence: 99%

“…As deep learning becomes pervasive and moves towards edge devices, DNN deployment becomes harder because of the mistmatch between resource-hungry DNNs and resourceconstrained edge devices (Li, Zhou, and Chen 2018;Li et al 2019). Deep learning researchers and practitioners have proposed many techniques to alleviate this resource pressure (Chu, Funderlic, and Plemmons 2003;Han, Mao, and Dally 2015;Polino, Pascanu, and Alistarh 2018;Yim et al 2017;Pham et al 2018). Among these efforts, DNN pruning is a promising approach (Li et al 2016;Han, Mao, and Dally 2015;Molchanov et al 2016;Theis et al 2018;Renda, Frankle, and Carbin 2020), which identifies the parameters (or weight elements) that do not contribute significantly to the accuracy, and prunes them from the network.…”

Section: Introductionmentioning

confidence: 99%

Iterative Activation-based Structured Pruning

Zhao¹,

Jain²,

Zhao³

2022

Preprint

View full text Add to dashboard Cite

Deploying complex deep learning models on edge devices is challenging because they have substantial compute and memory resource requirements, whereas edge devices' resource budget is limited. To solve this problem, extensive pruning techniques have been proposed for compressing networks. Recent advances based on the Lottery Ticket Hypothesis (LTH) show that iterative model pruning tends to produce smaller and more accurate models. However, LTH research focuses on unstructured pruning, which is hardwareinefficient and difficult to accelerate on hardware platforms. In this paper, we investigate iterative pruning in the context of structured pruning because structurally pruned models map well on commodity hardware. We find that directly applying a structured weight-based pruning technique iteratively, called iterative L1-norm based pruning (ILP), does not produce accurate pruned models. To solve this problem, we propose two activation-based pruning methods, Iterative Activation-based Pruning (IAP) and Adaptive Iterative Activation-based Pruning (AIAP). We observe that, with only 1% accuracy loss, IAP and AIAP achieve 7.75ˆand 15.88ˆcompression on LeNet-5, and 1.25ˆand 1.71ˆcompression on ResNet-50, whereas ILP achieves 4.77ˆand 1.13ˆ, respectively.

show abstract

“…To conduct NAS, it normally requires a search space, a search algorithm and a set of training data. Current NAS research mainly focuses on improving the search algorithms [20,34,1,35], designing the search space [21,7,30], reducing the search cost [3,11,19,4,15] and integrating direct metrics with the search process [27,8].…”

Section: Related Workmentioning

confidence: 99%

Data-Free Neural Architecture Search via Recursive Label Calibration

Liu¹,

Shen²,

Long³

et al. 2021

Preprint

View full text Add to dashboard Cite

This paper aims to explore the feasibility of neural architecture search (NAS) given only a pre-trained model without using any original training data. This is an important circumstance for privacy protection, bias avoidance, etc., in real-world scenarios. To achieve this, we start by synthesizing usable data through recovering the knowledge from a pre-trained deep neural network. Then we use the synthesized data and their predicted soft-labels to guide neural architecture search. We identify that the NAS task requires the synthesized data (we target at image domain here) with enough semantics, diversity, and a minimal domain gap from the natural images. For semantics, we propose recursive label calibration to produce more informative outputs. For diversity, we propose a regional update strategy to generate more diverse and semantically-enriched synthetic data. For minimal domain gap, we use input and feature-level regularization to mimic the original data distribution in latent space. We instantiate our proposed framework with three popular NAS algorithms: DARTS [15], ProxylessNAS [4] and SPOS [8]. Surprisingly, our results demonstrate that the architectures discovered by searching with our synthetic data achieve accuracy that is comparable to, or even higher than, architectures discovered by searching from the original ones, for the first time, deriving the conclusion that NAS can be done effectively with no need of access to the original or called natural data if the synthesis method is well designed. Our code will be publicly available.

show abstract

Efficient Neural Architecture Search via Parameter Sharing

Cited by 282 publications

References 11 publications

Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search

Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search

Iterative Activation-based Structured Pruning

Data-Free Neural Architecture Search via Recursive Label Calibration

Contact Info

Product

Resources

About