2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016
DOI: 10.1109/cvpr.2016.251
|View full text |Cite
|
Sign up to set email alerts
|

CNN-RNN: A Unified Framework for Multi-label Image Classification

Abstract: While deep convolutional neural networks (CNNs) have shown a great success in single-label image classification, it is important to note that real world images generally contain multiple labels, which could correspond to different objects, scenes, actions and attributes in an image. Traditional approaches to multi-label image classification learn independent classifiers for each category and employ ranking or thresholding on the classification results. These techniques, although working well, fail to explicitl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

3
712
0
4

Year Published

2017
2017
2022
2022

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 1,049 publications
(769 citation statements)
references
References 37 publications
3
712
0
4
Order By: Relevance
“…Some research employs a unified CNN-RNN framework learns a joint image-label embedding to characterize the semantic label dependency as well as the image-label relevance, and it can be trained end-to-end from scratch to integrate both information in one framework, the model achieves better performance than the state-of-the-art multi-label classification models [35]. The model for intelligent analysis in this paper is mainly composed of two parts, a CNN for learning styles detection and a RNN combined with gated recurrent unit for learning styles prediction.…”
Section: Intelligent Analysis Processmentioning
confidence: 99%
“…Some research employs a unified CNN-RNN framework learns a joint image-label embedding to characterize the semantic label dependency as well as the image-label relevance, and it can be trained end-to-end from scratch to integrate both information in one framework, the model achieves better performance than the state-of-the-art multi-label classification models [35]. The model for intelligent analysis in this paper is mainly composed of two parts, a CNN for learning styles detection and a RNN combined with gated recurrent unit for learning styles prediction.…”
Section: Intelligent Analysis Processmentioning
confidence: 99%
“…It is worth noting that the recent progress in deep learning, especially the combination of convolution neural network (CNN) and recurrent neural network (RNN), has been shown to outperform most of the traditional approaches (which utilize human‐engineered feature representation and machine learning mapping) for a number of applications, such as speech recognition, text annotation, and image tagging (Bertero & Fung, ; Y. Kim, ; Li & Wu, ; Shen & Lee, ; Wang et al, ). However, the price for the increased predictive accuracy is the decreased interpretability of the feature representation used in deep learning methods.…”
mentioning
confidence: 99%
“…Bo Wang et al [19], proposed a dynamic label propagation (DLP), which improves multi-label classification using semi-supervised learning. Jiang Wang et al [10], proposed a combine CNN-RNN framework for multi-label classification for images. Advantages of CNN and RNN are use for joint image or label co-occurrence embedding.…”
Section: Related Workmentioning
confidence: 99%