2021
DOI: 10.48550/arxiv.2101.04626
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Predicting Relative Depth between Objects from Semantic Features

Stefan Cassar,
Adrian Muscat,
Dylan Seychell

Abstract: Vision and language tasks such as Visual Relation Detection and Visual Question Answering benefit from semantic features that afford proper grounding of language. The 3D depth of objects depicted in 2D images is one such feature. However it is very difficult to obtain accurate depth information without learning the appropriate features, which are scene dependent. The state of the art in this area are complex Neural Network models trained on stereo image data to predict depth per pixel. Fortunately, in some tas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 29 publications
(55 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?