2020
DOI: 10.1007/978-3-030-58571-6_11
|View full text |Cite
|
Sign up to set email alerts
|

Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search

Abstract: In this paper, we introduce a new reinforcement learning (RL) based neural architecture search (NAS) methodology for effective and efficient generative adversarial network (GAN) architecture search. The key idea is to formulate the GAN architecture search problem as a Markov decision process (MDP) for smoother architecture sampling, which enables a more effective RL-based search algorithm by targeting the potential global optimal architecture. To improve efficiency, we exploit an off-policy GAN architecture se… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
28
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 48 publications
(33 citation statements)
references
References 24 publications
2
28
0
Order By: Relevance
“…Dai et al [99] Data adapted pruning for efficient neural architecture search DA-NAS 2020 ECCV Gradient based Classification Tian et al [100] Efficient and effective GAN architecture search E 2 GAN 2020 ECCV Reinforcement learning GAN Chu et al [101] Fair differentiable architecture search FairDARTS 2020 ECCV Gradient based Classification Hu et al [102] Three-freedom neural architecture search TF-NAS 2020 ECCV Gradient based Classification Hu et al [103] Angle-based search space shrinking ABS 2020 ECCV Other Classification Yu et al [104] Barrier penalty neural architecture search BP-NAS 2020 ECCV Other Classification Wang et al [105] Attention cell search for video classification AttentionNAS 2020 ECCV Other Video classification Bulat et al [106] Binary architecTure search BATS 2020 ECCV Other Classification Yu et al [107] Neural architecture search with big single-stage models BigNAS 2020 ECCV Gradient based Classification Guo et al [108] Single path one-shot neural architecture search with uniform sampling Single-Path-SuperNet 2020 ECCV Evolutionary algorithm Classification Liu et al [109] Unsupervised neural architecture search UnNAS 2020 ECCV Gradient based Classification get tasks, which can solve large GPU memory consumption problems and long computation time of the NAS method. Liu et al [67] proposed the method of DARTS for effective structure search.…”
Section: Gradient Based Classificationmentioning
confidence: 99%
“…Dai et al [99] Data adapted pruning for efficient neural architecture search DA-NAS 2020 ECCV Gradient based Classification Tian et al [100] Efficient and effective GAN architecture search E 2 GAN 2020 ECCV Reinforcement learning GAN Chu et al [101] Fair differentiable architecture search FairDARTS 2020 ECCV Gradient based Classification Hu et al [102] Three-freedom neural architecture search TF-NAS 2020 ECCV Gradient based Classification Hu et al [103] Angle-based search space shrinking ABS 2020 ECCV Other Classification Yu et al [104] Barrier penalty neural architecture search BP-NAS 2020 ECCV Other Classification Wang et al [105] Attention cell search for video classification AttentionNAS 2020 ECCV Other Video classification Bulat et al [106] Binary architecTure search BATS 2020 ECCV Other Classification Yu et al [107] Neural architecture search with big single-stage models BigNAS 2020 ECCV Gradient based Classification Guo et al [108] Single path one-shot neural architecture search with uniform sampling Single-Path-SuperNet 2020 ECCV Evolutionary algorithm Classification Liu et al [109] Unsupervised neural architecture search UnNAS 2020 ECCV Gradient based Classification get tasks, which can solve large GPU memory consumption problems and long computation time of the NAS method. Liu et al [67] proposed the method of DARTS for effective structure search.…”
Section: Gradient Based Classificationmentioning
confidence: 99%
“…In adversarial training, the discriminator and generator compete, forcing the generator to produce high-quality output that can fool the discriminator. Adversarial training is usually successful in image generation (Karras et al 2019;Tian et al 2020;Gong et al 2019), limited contribution to natural language processing tasks (Wiseman and Rush 2016;Yang et al 2018;Yu et al 2017), mainly due to the difficulty in propagating error signals from discriminator to generator through discrete generated natural language tokens. Yu et al (2017) alleviates such difficulties employing reinforcement learning methods for sequence generation.…”
Section: Related Workmentioning
confidence: 99%
“…In this paper, we propose an Open IE system with Generative Adversarial Networks (Goodfellow et al 2014) architecture. GANs is a promising framework for alleviating exposure bias problem and recently shows remarkable promise in many tasks, such as machine translation (Wiseman and Rush 2016; Yang et al 2018;Yu et al 2017), especially in image generation (Karras et al 2019;Tian et al 2020;Gong et al 2019). Besides the typical sequence-tosequence model (implemented by Transformer and output the sequence with separators) to address the Open IE problem.…”
Section: Introductionmentioning
confidence: 99%
“…Combined with powerful function approximators, such as deep neural networks, RL methods can work with complex large-state spaces. RL methods have been applied to various control problems in robotics [16][17][18], water systems management [19], computational biology [20], and AutoML [21]. RL methods have multiple advantages over traditional methods: (1) RL agents can learn to solve tasks without any knowledge of the underlying model.…”
Section: Introductionmentioning
confidence: 99%