2020
DOI: 10.1007/978-3-030-58589-1_39
|View full text |Cite
|
Sign up to set email alerts
|

Attention-Driven Dynamic Graph Convolutional Network for Multi-label Image Recognition

Abstract: Recent studies often exploit Graph Convolutional Network (GCN) to model label dependencies to improve recognition accuracy for multi-label image recognition. However, constructing a graph by counting the label co-occurrence possibilities of the training data may degrade model generalizability, especially when there exist occasional cooccurrence objects in test images. Our goal is to eliminate such bias and enhance the robustness of the learnt features. To this end, we propose an Attention-Driven Dynamic Graph … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
85
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 122 publications
(87 citation statements)
references
References 35 publications
1
85
0
Order By: Relevance
“…We can see that our method achieves the best mAP performance among all methods. We also observe the small margin between our results and ADD-GCN [52]. In addition to the difference in input image resolution (ours 448 × 448 and ADD-GCN's 512 × 512), the small increase may indicate the limited data of VOC 07 and its saturated metric.…”
Section: Performance On Pascal Vocmentioning
confidence: 53%
See 1 more Smart Citation
“…We can see that our method achieves the best mAP performance among all methods. We also observe the small margin between our results and ADD-GCN [52]. In addition to the difference in input image resolution (ours 448 × 448 and ADD-GCN's 512 × 512), the small increase may indicate the limited data of VOC 07 and its saturated metric.…”
Section: Performance On Pascal Vocmentioning
confidence: 53%
“…Chen et al [8] constructed a similar graph but based on class-aware maps, which is calculated by image level feature and classification weights, and constrained the graph by label co-occurrence. Rather than using static graph, Ye et al [52] updated static graph to dynamic graph by using a dynamic graph convolutional network(GCN) module for robust representations. While modeling label correlations can introduce additional gains in multi-label classification, it is also arguable that it may learn spurious correlations when the label statistics are insufficient.…”
Section: Multi-label Classificationmentioning
confidence: 99%
“…Hu et al [19] leverages a heterogeneous GNN to handle multiple types of nodes such as topics and entities for text classification. Similarly, we use heterogeneous GNN to obtain the preference scores of each token, but our graph is dynamic as its node correlation matrix is adjustable (inspired by [59]). The final adjusted correlations will be aggregated to obtain preference scores.…”
Section: Graph Neural Network For Text Miningmentioning
confidence: 99%
“…Zhang et al [42] proposed a hierarchical and transparent representation learning method to express the semantic information for accurate paper-reviewer recommendation as multi-label classification. Ye et al [43] introduced a dynamic graph convolutional network to project raw input into categoryaware representations with semantic attention module, and the final category representations are utilized for multi-label image recognition. Gong et al [44] proposed a hierarchical graph transformer method for multi-label text classification, a multi-layer transformer structure and the hierarchical relationship of the labels are used for feature representations learning in different level.…”
Section: Related Workmentioning
confidence: 99%