2021 IEEE International Conference on Multimedia and Expo (ICME) 2021
DOI: 10.1109/icme51207.2021.9428148
|View full text |Cite
|
Sign up to set email alerts
|

Knowledge Transfer Based Fine-Grained Visual Classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 19 publications
0
4
0
Order By: Relevance
“…A lower similarity 𝑀 represents a higher complementarity between these two features. Next, we can use the similarity 𝑀 to obtain the complementarity factor 𝐴 ∈ [0,1], 𝐴 = π‘†π‘œπ‘“π‘‘π‘€π‘Žπ‘₯(βˆ’π‘€) (13) where 𝐴 ∈ 𝑅 Γ— represents the degree of complementarity of feature 𝑋 to feature 𝑋 . Then using the complementarity factor we can get the complementary information π‘Œ .…”
Section: 𝑀 = 𝑓(𝑋 𝑋 ) 𝑓 = 𝑋 π‘Œmentioning
confidence: 99%
See 1 more Smart Citation
“…A lower similarity 𝑀 represents a higher complementarity between these two features. Next, we can use the similarity 𝑀 to obtain the complementarity factor 𝐴 ∈ [0,1], 𝐴 = π‘†π‘œπ‘“π‘‘π‘€π‘Žπ‘₯(βˆ’π‘€) (13) where 𝐴 ∈ 𝑅 Γ— represents the degree of complementarity of feature 𝑋 to feature 𝑋 . Then using the complementarity factor we can get the complementary information π‘Œ .…”
Section: 𝑀 = 𝑓(𝑋 𝑋 ) 𝑓 = 𝑋 π‘Œmentioning
confidence: 99%
“…In addition to this, these three types of methods have the problem of over-enhancing some features, which leads to over-fitting of the system and poor generalization ability. In order to overcome the problem of poor generalization ability caused by over-enhancing some of the features, this paper proposes multi-feature enhancement module (MFEM) inspired by the use of erasure mechanism to force the network to focus on the potential information of the classification network [12], [13]. MFEM uses the attention mechanism to obtain an enhanced feature map at each stage of the network, and then weakens the most salient features at the next stage before obtaining an enhanced feature map.…”
Section: Introductionmentioning
confidence: 99%
“…The WSLL can locate lesions in a weakly-supervised manner for further lesion decomposition with the CMZ strategy. C. Contrastive Multi-Zoom strategy 1) Motivation of the CMZ strategy: Extracting disentangled features can enhance recognition as not all discriminative regions (e.g., lesions) appear [18], [42], [43]. Intuitively, the various lesion scope induces significant intra-class variance.…”
Section: B Weakly-supervised Lesion Localization Modulementioning
confidence: 99%
“…More specifically, low-overlapping and disentangled inputs are more suitable for partial lesion recognition than the entangled ones. Nonetheless, the widely-used attention-guided Center Zoom mechanism only zooms in on the whole suspicious area, which increases the difficulties of feature decomposition and the probability of ignoring subtle regions [18], [43]. We delicately designed our CMZ strategy for reducing the intersection of the local-branch inputs inspired by Contrastive Crop [44], which is proposed first in self-supervised learning (SSL) for data augmentation.…”
Section: B Weakly-supervised Lesion Localization Modulementioning
confidence: 99%