sharpDARTS: Faster and More Accurate Differentiable Architecture Search

Hundt, Andrew; Jain, Veena; Hager, Gregory D.

doi:10.48550/arxiv.1903.09900

Cited by 14 publications

(22 citation statements)

References 16 publications

(46 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Searching Phase's CO2 emissions: In order to determine the amount of CO2 emissions from CV-NAS search-ing phase, we used a Strubell et al [48] inspired methodology. We begin by collecting the 45 major CV-NAS papers in the last three years (2018-2020) [11,7,61,13,37,10,42,15,21,55,60,8,56,24,25,31,38,14,59,26,53,17,45,44,46,16,57,20,22,29,36,50,35,27,43,12,51,41,40,19,18,32,58,39,62] finding 157 models. For every model, we extract the Top-1 Accuracy, Parameters, FLOPS, GPU hours and GPU type.…”

Section: Methods and Resultsmentioning

confidence: 99%

Reconsidering CO2 emissions from Computer Vision

Fu¹,

Hosseini²,

Plataniotis³

2021

Preprint

View full text Add to dashboard Cite

Climate change is a pressing issue that is currently affecting and will affect every part of our lives. It's becoming incredibly vital we, as a society, address the climate crisis as a universal effort, including those in the Computer Vision (CV) community. In this work, we analyze the total cost of CO2 emissions by breaking it into (1) the architecture creation cost and (2) the life-time evaluation cost. We show that over time, these costs are non-negligible and are having a direct impact on our future. Importantly, we conduct an ethical analysis of how the CV-community is unintentionally overlooking its own ethical AI principles by emitting this level of CO2. To address these concerns, we propose adding "enforcement" as a pillar of ethical AI and provide some recommendations for how architecture designers and broader CV community can curb the climate crisis.

show abstract

Section: Methods and Resultsmentioning

confidence: 99%

Reconsidering CO2 emissions from Computer Vision

Fu¹,

Hosseini²,

Plataniotis³

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…We argue that a generic alternate optimization of network weights and architecture weights, as suggested in previous works, e.g. [17,25], is not suitable for the unique structure of the architecture space. Hence, we design a tailor-made optimizer for this task, inspired by PEA theory.…”

Section: Xnas: Experts Neural Architecture Searchmentioning

confidence: 86%

XNAS: Neural Architecture Search with Expert Advice

Nayman¹,

Noy²,

Ridnik³

et al. 2019

Preprint

View full text Add to dashboard Cite

This paper introduces a novel optimization method for differential neural architecture search, based on the theory of prediction with expert advice. Its optimization criterion is well fitted for an architecture-selection, i.e., it minimizes the regret incurred by a sub-optimal selection of operations. Unlike previous search relaxations, that require hard pruning of architectures, our method is designed to dynamically wipe out inferior architectures and enhance superior ones. It achieves an optimal worst-case regret bound and suggests the use of multiple learning-rates, based on the amount of information carried by the backward gradients. Experiments show that our algorithm achieves a strong performance over several image classification datasets. Specifically, it obtains an error rate of 1.6% for CIFAR-10, 24% for ImageNet under mobile settings, and achieves state-of-the-art results on three additional datasets.

show abstract

“…Early NAS methods adopt reinforcement learning (RL) or evolutionary strategy [38,2,3,31,30,39] to search among thousands of individually trained networks, which costs huge computation sources. Recent works focus on efficient weight-sharing methods, which falls into two categories: one-shot approaches [6,4,1,7,18,33,29] and gradient-based approaches [32,27,9,8,20,12,34,23], achieve state-of-the-art results on a series of tasks [10,17,24,35,16,28] in various search spaces. They construct a super network/graph which shares weights with all sub-network/graphs.…”

Section: Related Workmentioning

confidence: 99%

“…The search phase costs 4 GPUs for about 28 hours on NVIDIA GeForce RTX 2080ti. In the retraining phase, we adopt the training strategy as previous works [20] to train the searched architecture from scratch, without any additional module. The whole process lasts 250 epochs, using SGD optimizer with a momentum of 0.9, a weight decay of 3×10 −5 .…”

Section: Dartsmentioning

confidence: 99%

See 1 more Smart Citation

Single-DARTS: Towards Stable Architecture Search

Hou¹,

Jin²,

Chen³

2021

Preprint

View full text Add to dashboard Cite

Differentiable architecture search (DARTS) marks a milestone in Neural Architecture Search (NAS), boasting simplicity and small search costs. However, DARTS still suffers from frequent performance collapse, which happens when some operations, such as skip connections, zeroes and poolings, dominate the architecture. In this paper, we are the first to point out that the phenomenon is attributed to bi-level optimization.We propose Single-DARTS which merely uses singlelevel optimization, updating network weights and architecture parameters simultaneously with the same data batch. Even single-level optimization has been previously attempted, no literature provides a systematic explanation on this essential point. Replacing the bi-level optimization, Single-DARTS obviously alleviates performance collapse as well as enhances the stability of architecture search. Experiment results show that Single-DARTS achieves state-of-the-art performance on mainstream search spaces. For instance, on NAS-Benchmark-201, the searched architectures are nearly optimal ones. We also validate that the single-level optimization framework is much more stable than the bi-level one. We hope that this simple yet effective method will give some insights on differential architecture search. The code is available at https://github.com/PencilAndBike/Single-DARTS.git.

show abstract

sharpDARTS: Faster and More Accurate Differentiable Architecture Search

Cited by 14 publications

References 16 publications

Reconsidering CO2 emissions from Computer Vision

Reconsidering CO2 emissions from Computer Vision

XNAS: Neural Architecture Search with Expert Advice

Single-DARTS: Towards Stable Architecture Search

Contact Info

Product

Resources

About