2019
DOI: 10.1007/978-3-030-20890-5_16
|View full text |Cite
|
Sign up to set email alerts
|

Aligning Salient Objects to Queries: A Multi-modal and Multi-object Image Retrieval Framework

Abstract: In this paper we propose an approach for multi-modal image retrieval in multi-labelled images. A multi-modal deep network architecture is formulated to jointly model sketches and text as input query modalities into a common embedding space, which is then further aligned with the image feature space. Our architecture also relies on a salient object detection through a supervised LSTM-based visual attention model learned from convolutional features. Both the alignment between the queries and the image and the su… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
1
1
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 34 publications
(36 reference statements)
0
1
0
Order By: Relevance
“…Sketches can be used to retrieve natural photos [11,21,29,54,57], manga [34], 3D shape [52], video [45] and many more. Specifically, sketch based image retrieval (SBIR) [8] has seen lot of research done to bridge this domain gap between sparse line drawings with dense pixel representations using cross modal deep learning techniques [7]. In these lines, different work has also been conducted on fine-grained SBIR [53], zero-shot SBIR [9], large-scale SBIR [48], and many others.…”
Section: Related Workmentioning
confidence: 99%
“…Sketches can be used to retrieve natural photos [11,21,29,54,57], manga [34], 3D shape [52], video [45] and many more. Specifically, sketch based image retrieval (SBIR) [8] has seen lot of research done to bridge this domain gap between sparse line drawings with dense pixel representations using cross modal deep learning techniques [7]. In these lines, different work has also been conducted on fine-grained SBIR [53], zero-shot SBIR [9], large-scale SBIR [48], and many others.…”
Section: Related Workmentioning
confidence: 99%