2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00805
|View full text |Cite
|
Sign up to set email alerts
|

Bidirectional Retrieval Made Simple

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
4

Relationship

2
7

Authors

Journals

citations
Cited by 34 publications
(23 citation statements)
references
References 21 publications
0
23
0
Order By: Relevance
“…We have also proposed a novel inception-inspired text encoder named CHAIN-VSE for efficient multimodal retrieval [Wehrmann and Barros 2018]. That work was accepted in the CVPR 2018 main conference, which is the conference with highest H-index in computer science as of today.…”
Section: Summary Of Contributionsmentioning
confidence: 99%
“…We have also proposed a novel inception-inspired text encoder named CHAIN-VSE for efficient multimodal retrieval [Wehrmann and Barros 2018]. That work was accepted in the CVPR 2018 main conference, which is the conference with highest H-index in computer science as of today.…”
Section: Summary Of Contributionsmentioning
confidence: 99%
“…Wehrmann et al [45] improve sentence representations with a character level inception module and [20,26] improve image representations for image-text matching models. Huang et al [20] use multi-label classification to extract various concepts in images, requiring additional image annotations.…”
Section: Related Workmentioning
confidence: 99%
“…As the embedding space is learned through jointly modeling vision and language, it is often referred as Visual Semantic Embeddings (VSE). Recent work on VSE has shown a clear trend of growing dimensions in order to obtain better embedding quality (Wehrmann 2018). With deeper embeddings, visual semantic hubs increase dramatically.…”
Section: Introductionmentioning
confidence: 99%