2014
DOI: 10.1016/j.cviu.2014.03.003
|View full text |Cite
|
Sign up to set email alerts
|

Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems

Abstract: a b s t r a c tIn query-by-semantic-example image retrieval, images are ranked by similarity of semantic descriptors. These descriptors are obtained by classifying each image with respect to a pre-defined vocabulary of semantic concepts. In this work, we consider the problem of improving the accuracy of semantic descriptors through cross-modal regularization, based on auxiliary text. A cross-modal regularizer, composed of three steps, is proposed. Training images and text are first mapped to a common semantic … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
5
3
2

Relationship

1
9

Authors

Journals

citations
Cited by 26 publications
(7 citation statements)
references
References 76 publications
0
7
0
Order By: Relevance
“…It should be noted that, the above work is restricted to the case of the combination of two different media types. Pereira's work [25] follows crossmedia idea, and improves the content-based image retrieval by using the contextual information of images. In this work, the training images and texts are first mapped to a common semantic space and then a regularization operator is learned for each concept in the semantic vocabulary.…”
Section: Related Workmentioning
confidence: 98%
“…It should be noted that, the above work is restricted to the case of the combination of two different media types. Pereira's work [25] follows crossmedia idea, and improves the content-based image retrieval by using the contextual information of images. In this work, the training images and texts are first mapped to a common semantic space and then a regularization operator is learned for each concept in the semantic vocabulary.…”
Section: Related Workmentioning
confidence: 98%
“…We can consider swap retrieval as a special case of domain adaptation, where the domain shift is actually a change in the object category that we are looking for. Pereira and Vasconcelos [22] for instance apply a cross-modal domain adaptation to the task of image retrieval by considering image and text as source and target domains. In our task, we consider the features corresponding to the object category in the query image and the swapped category as source and target domains.…”
Section: Domain Adaptationmentioning
confidence: 99%
“…a phone connection [20,24]. In computer vision, adaptation has been proposed to bridge gap between camera views [8], data modalities [6,16,17], image conditions [22], or even object classes [10]. Recently, model adaptation has been used in the deep learning literature, to adapt a model learned from the Imagenet corpus [9] to other tasks [7].…”
Section: Related Workmentioning
confidence: 99%