2020
DOI: 10.1109/tcyb.2020.2983860
|View full text |Cite
|
Sign up to set email alerts
|

Automatically Designing CNN Architectures Using the Genetic Algorithm for Image Classification

Abstract: Convolutional Neural Networks (CNNs) have gained a remarkable success on many real-world problems in recent years. However, the performance of CNNs is highly relied on their architectures. For some state-of-the-art CNNs, their architectures are hand-crafted with expertise in both CNNs and the investigated problems. To this end, it is difficult for researchers, who have no extended expertise in CNNs, to explore CNNs for their own problems of interest. In this paper, we propose an automatic architecture design m… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
300
0
12

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 622 publications
(355 citation statements)
references
References 45 publications
(96 reference statements)
0
300
0
12
Order By: Relevance
“…However, most real-world users often do not have such knowledge. In recent years, evolutionary computation (EC) has shown to be effective in automatically searching for the optimal architecture of CNNs [5] [6] [7].…”
Section: Introductionmentioning
confidence: 99%
“…However, most real-world users often do not have such knowledge. In recent years, evolutionary computation (EC) has shown to be effective in automatically searching for the optimal architecture of CNNs [5] [6] [7].…”
Section: Introductionmentioning
confidence: 99%
“…In particular, deep neural networks (DNNs) [1] have performed excellently in machine learning tasks such as recognition [2,3] and pattern analysis [4]. Particularly, the genetic algorithm [5] and the ant colony algorithm [6] using bio-inspired methods [7] have shown improved image recognition results. The genetic algorithm can find the most optimally performing convolutional neural network (CNN) structure among given CNN models for an image task without pre-or post-processing.…”
Section: Introductionmentioning
confidence: 99%
“…El entrenamiento de una CNN requiere mucho tiempo y es computacionalmente costoso pues utiliza grandes conjuntos de imágenes de entrenamiento (típicamente se requieren varias decenas de miles de imágenes). Esto dificulta el empleo de otras técnicas que buscan mejorar el desempeño de CNNs o lograr el diseño automático de redes; por ejemplo, los trabajos [6] y [7] proponen llevar a cabo la optimización de la arquitectura de CNNs para las tareas de clasificación de imágenes (ver Fig. 1), mediante el uso de algoritmos evolutivos.…”
Section: Introductionunclassified
“…Aunque dicha estrategia de optimización de la arquitectura de CNN mediante un algoritmo evolutivo ha resultado exitosa, su complejidad computacional es muy elevada y puede ser prohibitiva para muchos investigadores y practicantes que no cuentan con una infraestructura de cómputo suficientemente potente. Por ejemplo, en [7] se menciona que el tiempo requerido para la evolución de una CNN utilizando una variante de algoritmo genético y aplicada al conjunto de datos CIFAR10 1 fue de 35 días de cómputo en una GPU; mientras, en [8] se reporta que el método allí descrito consume 2,750 días de cómputo en una GPU para evolucionar una red para el mismo problema de CIFAR10 (para más datos de sistemas del estado del arte, ver [7]).…”
Section: Introductionunclassified
See 1 more Smart Citation