2017
DOI: 10.1007/s11042-017-5443-x
|View full text |Cite
|
Sign up to set email alerts
|

CNN-RNN: a large-scale hierarchical image classification framework

Abstract: Objects are often organized in a semantic hierarchy of categories, where finelevel categories are grouped into coarse-level categories according to their semantic relations. While previous works usually only classify objects into the leaf categories, we argue that generating hierarchical labels can actually describe how the leaf categories evolved from higher level coarse-grained categories, thus can provide a better understanding of the objects. In this paper, we propose to utilize the CNN-RNN framework to ad… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
60
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 112 publications
(60 citation statements)
references
References 37 publications
(62 reference statements)
0
60
0
Order By: Relevance
“…However, this results in a very complex training and inference procedure. Guo et al (2018) propose a CNN-RNN (recurrent neural network) strategy to address hierarchical classification. A CNN acts as a feature extractor and is trained to predict class labels at the coarse semantic level.…”
Section: Related Workmentioning
confidence: 99%
“…However, this results in a very complex training and inference procedure. Guo et al (2018) propose a CNN-RNN (recurrent neural network) strategy to address hierarchical classification. A CNN acts as a feature extractor and is trained to predict class labels at the coarse semantic level.…”
Section: Related Workmentioning
confidence: 99%
“…In the early stages of this field, some people tried to combine CNNs and RNNs to solve the general image classification problem. The most straightforward method is to use a CNN to extract histology image features and then input the extracted features into an RNN [5][6][7]. The method proposed by Zheng et al [8] and the method proposed by Nahid et al [9] flatten the feature of the last layer of the convolutional neural network to one dimension and then input the result into the recurrent neural network.…”
Section: Introductionmentioning
confidence: 99%
“…An and Liu [13] proposed a CNN–LSTM–support vector machine (SVM) hybrid method for facial expression recognition. In [14], Guo et al performed image classification with a CNN–RNN hybrid model. In [15], Lakhal et al employed CNN architecture to encode remote sensing images into a feature space, and then used LSTM architecture to decode these features to predict the class labels.…”
Section: Introductionmentioning
confidence: 99%