2021
DOI: 10.1049/ipr2.12280
|View full text |Cite
|
Sign up to set email alerts
|

Leveraging attention‐based visual clue extraction for image classification

Abstract: Deep learning-based approaches have made considerable progress in image classification tasks, but most of the approaches lack interpretability, especially in revealing the decisive information causing the categorization of images. This paper seeks to answer the question of what clues encode the discriminative visual information between image categories and can help improve the classification performance. To this end, an attention-based clue extraction network (ACENet) is introduced to mine the decisive local v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 34 publications
0
1
0
Order By: Relevance
“…Therefore, how to accurately extract and properly fuse the semantic features of each media data for jointly semantics analysis, is the key to understand multimedia contents. Recently, Deep Neural Networks (DNNs) have shown great power in feature representation learning and achieved superior performance on various tasks, including image recognition [3][4][5], object detection [6][7][8] etc. However, they still have the following drawbacks on the multimedia content understanding, which if addressed would improve their performance.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, how to accurately extract and properly fuse the semantic features of each media data for jointly semantics analysis, is the key to understand multimedia contents. Recently, Deep Neural Networks (DNNs) have shown great power in feature representation learning and achieved superior performance on various tasks, including image recognition [3][4][5], object detection [6][7][8] etc. However, they still have the following drawbacks on the multimedia content understanding, which if addressed would improve their performance.…”
Section: Introductionmentioning
confidence: 99%