2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00368
|View full text |Cite
|
Sign up to set email alerts
|

Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective

Abstract: Zero-shot learning (ZSL) aims to recognize instances of unseen classes solely based on the semantic descriptions of the classes. Existing algorithms usually formulate it as a semantic-visual correspondence problem, by learning mappings from one feature space to the other. Despite being reasonable, previous approaches essentially discard the highly precious discriminative power of visual features in an implicit way, and thus produce undesirable results. We instead reformulate ZSL as a conditioned visual classif… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
66
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
4
2

Relationship

1
9

Authors

Journals

citations
Cited by 110 publications
(66 citation statements)
references
References 40 publications
(57 reference statements)
0
66
0
Order By: Relevance
“…Originating from non-realistic rendering [18], image style transfer is closely related to texture synthesis [5,7,6]. Gatys et al [8] were the first to formulate style transfer as the matching of multi-level deep features extracted from a pre-trained deep neural network, which has been widely used in various tasks [20,21,22]. Lots of improvements have been proposed based on the works of Gatys et al [8].…”
Section: Related Workmentioning
confidence: 99%
“…Originating from non-realistic rendering [18], image style transfer is closely related to texture synthesis [5,7,6]. Gatys et al [8] were the first to formulate style transfer as the matching of multi-level deep features extracted from a pre-trained deep neural network, which has been widely used in various tasks [20,21,22]. Lots of improvements have been proposed based on the works of Gatys et al [8].…”
Section: Related Workmentioning
confidence: 99%
“…In other words, it addresses multi-class learning problems when some classes do not have sufficient training data. However, during the learning process, additional visual and semantic features such as word embeddings [132], visual attributes [133], or descriptions [134] can be assigned to both seen and unseen classes. In the context of multimodality, a multimodal mapping scheme typically combines visual and semantic attributes using only data related to the seen classes.…”
Section: Zero-shot Learningmentioning
confidence: 99%
“…The application of zero-shot in super-resolution, i.e, the ZSSR prescription [23], is among the most widely used models in superresolution and has gained increasing interest recently. In addition, the majority of zero-shot methods that have provided a large number of excellent results in recent years are mainly based on segmentation [39], emotion recognition [40], object detection [41], image retrieval [42][43][44][45], image classification [46][47][48] and intelligent learning in machines or robots [49]. In the ZSSR formalism, LR images are downsampled to generate many lower-resolution images (I = I 0 , I 1 , I 2 , ..., I n ), which serve as the HR supervision information called "HR fathers, " then, each HR father is downscaled by the required scale factor s to obtain the corresponding "LR sons.…”
Section: Zero-shotmentioning
confidence: 99%