2024
DOI: 10.1016/j.displa.2023.102616
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting multi-scale contextual prompt learning for zero-shot semantic segmentation

Yiqi Wang,
Yingjie Tian
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 43 publications
0
2
0
Order By: Relevance
“…where Equation (7) represents the mapping of semantic features to visual space, v represents attribute semantic features, f represents sample visual features, and 1 W is a learnable mapping matrix to compute the correlation between visual features and attribute semantic features. Similarly, Equation ( 8) represents the mapping of visual features to attribute semantic space, and 2 W is a learnable mapping matrix to compute the correlation between attribute semantic features and visual features.…”
Section: Feature Mapping Modulementioning
confidence: 99%
See 1 more Smart Citation
“…where Equation (7) represents the mapping of semantic features to visual space, v represents attribute semantic features, f represents sample visual features, and 1 W is a learnable mapping matrix to compute the correlation between visual features and attribute semantic features. Similarly, Equation ( 8) represents the mapping of visual features to attribute semantic space, and 2 W is a learnable mapping matrix to compute the correlation between attribute semantic features and visual features.…”
Section: Feature Mapping Modulementioning
confidence: 99%
“…The difference between zero-shot image classification and traditional image classification is whether the training set contains the category samples from the test set, as shown in Figure 1. Since its development, the idea of zero-shot learning has attracted much attention and has been introduced into various tasks, such as zero-shot image retrieval [2,3] and zero-shot video classification [4] tasks in the field of computer vision, zero-shot sketch retrieval [5] tasks in the cross-modal field, zero-shot semantic segmentation [6,7] and zeroshot text classification [8] tasks in the field of natural language processing, and zero-shot visual quizzing [9] tasks in the multimodal field. The research object of this paper is the zero-shot image classification task, i.e., the zero-shot learning idea is applied to the task of image classification.…”
Section: Introductionmentioning
confidence: 99%