SNAS: Stochastic Neural Architecture Search

Xie, Sirui; Zheng, Hehui; Liu, Chunxiao; Li, Lin

doi:10.48550/arxiv.1812.09926

Cited by 199 publications

(203 citation statements)

References 21 publications

Supporting

Mentioning

202

Contrasting

Order By: Relevance

“…However, such methods require training the searched architecture from scratch for each search step, which is extremely computationally expensive. To address this, weight-sharing approaches have been proposed [4,7,8,10,29,55,66,80,86,93,103]. They train the supernet once which includes all architecture candidates.…”

Section: Neural Architecture Searchmentioning

confidence: 99%

“…Compared to standard ANNs, SNNs require significantly higher computational cost for training due to multiple feedforward steps [53]. This makes it difficult to search for an optimal SNN architecture with NAS techniques that train the architecture candidate multiple times [2,78,[107][108][109] or train a complex supernet [8,29,55,86]. To minimize the training budget, our work is motivated by the previous works [12,58,88] which demonstrate that the optimal architecture can be founded without any training process.…”

Section: Nas Without Trainingmentioning

confidence: 99%

“…Cell-based approach [55,66,70,75,86,109] is widely used in NAS research. These methods usually search for the connection topology as well as the corresponding operation for each connection.…”

Section: Searching Forward and Backward Connectionsmentioning

confidence: 99%

“…For the first question, we highlight that the mainstream NAS algorithms either require multiple training stages [2,78,[107][108][109] or require training a supernet once with all architecture candidates [8,29,55,86] which takes longer training time to converge than standard training. As SNNs have a significantly slower training process compared to ANNs (e.g., training SNN on MNIST with NVIDIA V100 GPU takes 11.43× more latency compared to the same ANN architecture [53]), the above NAS approaches are difficult to be applied on SNNs.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Neural Architecture Search for Spiking Neural Networks

Kim¹,

Li²,

Park³

et al. 2022

Preprint

View full text Add to dashboard Cite

Spiking Neural Networks (SNNs) have gained huge attention as a potential energy-efficient alternative to conventional Artificial Neural Networks (ANNs) due to their inherent high-sparsity activation. However, most prior SNN methods use ANN-like architectures (e.g., VGG-Net or ResNet), which could provide sub-optimal performance for temporal sequence processing of binary information in SNNs. To address this, in this paper, we introduce a novel Neural Architecture Search (NAS) approach for finding better SNN architectures. Inspired by recent NAS approaches that find the optimal architecture from activation patterns at initialization, we select the architecture that can represent diverse spike activation patterns across different data samples without training. Furthermore, to leverage the temporal correlation among the spikes, we search for feed forward connections as well as backward connections (i.e., temporal feedback connections) between layers. Interestingly, SNASNet found by our search algorithm achieves higher performance with backward connections, demonstrating the importance of designing SNN architecture for suitably using temporal information. We conduct extensive experiments on three image recognition benchmarks where we show that SNASNet achieves state-of-the-art performance with significantly lower timesteps (5 timesteps). The code has been released at https://github.com/Intelligent-Computing-Lab-Yale/Neural-Architecture-Search-for-Spiking-Neural-Networks.

show abstract

Section: Neural Architecture Searchmentioning

confidence: 99%

Section: Nas Without Trainingmentioning

confidence: 99%

Section: Searching Forward and Backward Connectionsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Neural Architecture Search for Spiking Neural Networks

Kim¹,

Li²,

Park³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Compared with traditional manually designed models, NAS requires less labor work and achieves better performance. Existing NAS algorithms can be categorized into three groups: 1) Reinforcement learning-based approaches [28,44,45], 2) Evolution-based approaches [22,29], and 3) Gradient-based approaches [3,6,36]. For reinforcement learning approaches, candidate architectures are sampled from search space based on reinforcement learning algorithms.…”

Section: Neural Architecture Searchmentioning

confidence: 99%

Improving Differentiable Architecture Search with a Generative Model

Zhang

Liang²,

Somayajula³

et al. 2021

Preprint

View full text Add to dashboard Cite

In differentiable neural architecture search (NAS) algorithms like DARTS, the training set used to update model weight and the validation set used to update model architectures are sampled from the same data distribution. Thus, the uncommon features in the dataset fail to receive enough attention during training. In this paper, instead of introducing more complex NAS algorithms, we explore the idea that adding quality synthesized datasets into training can help the classification model identify its weakness and improve recognition accuracy. We introduce a training strategy called "Differentiable Architecture Search with a Generative Model(DASGM)." In DASGM, the training set is used to update the classification model weight, while a synthesized dataset is used to train its architecture. The generated images have different distributions from the training set, which can help the classification model learn better features to identify its weakness. We formulate DASGM into a multi-level optimization framework and develop an effective algorithm to solve it. Experiments on CIFAR-10, CIFAR-100, and ImageNet have demonstrated the effectiveness of DASGM. Code will be made available.

show abstract

HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation

Lee²,

Chen³

2023

Appl Intell

View full text Add to dashboard Cite

SNAS: Stochastic Neural Architecture Search

Cited by 199 publications

References 21 publications

Neural Architecture Search for Spiking Neural Networks

Neural Architecture Search for Spiking Neural Networks

Improving Differentiable Architecture Search with a Generative Model

HCT-net: hybrid CNN-transformer model based on a neural architecture search network for medical image segmentation

Contact Info

Product

Resources

About