Visually Guided Spatial Relation Extraction from Text

Rahgooy, Taher; Manzoor, Umar; Kordjamshidi, Parisa

doi:10.18653/v1/n18-2124

Cited by 6 publications

(7 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The code is publicly available here 2 . We compare our approach with the state-of-theart (Rahgooy et al, 2018). However, in the mentioned paper, the authors use visual data from the accompanying images to improve the models.…”

Section: Resultsmentioning

confidence: 99%

Anaphora Resolution for Improving Spatial Relation Extraction from Text

Manzoor¹,

Kordjamshidi²

2018

Proceedings of the First International Workshop on Spatial Language Understanding

Self Cite

View full text Add to dashboard Cite

Spatial relation extraction from generic text is a challenging problem due to the ambiguity of the prepositions spatial meaning as well as the nesting structure of the spatial descriptions. In this work, we highlight the difficulties that the anaphora can make in the extraction of spatial relations. We use external multi-modal (here visual) resources to find the most probable candidates for resolving the anaphoras that refer to the landmarks of the spatial relations. We then use global inference to decide jointly on resolving the anaphora and extraction of the spatial relations. Our preliminary results show that resolving anaphora improves the state-ofthe-art results on spatial relation extraction.

show abstract

Section: Resultsmentioning

confidence: 99%

Anaphora Resolution for Improving Spatial Relation Extraction from Text

Manzoor¹,

Kordjamshidi²

2018

Proceedings of the First International Workshop on Spatial Language Understanding

Self Cite

View full text Add to dashboard Cite

show abstract

“…Spatial semantics is very closely connected and relevant to visualization of natural language and grounding language into perception, central to dealing with configurations in the physical world and motivating a combination of vision and language for richer spatial understanding. The related tasks include: text-to-scene conversion; image captioning; spatial and visual question answering; and spatial understanding in multimodal settings (Rahgooy et al, 2018) for robotics and navigation tasks and language grounding (Thomason et al, 2018).…”

Section: Descriptionmentioning

confidence: 99%

Representation, Learning and Reasoning on Spatial Language for Downstream NLP Tasks

Kordjamshidi¹,

Pustejovsky²,

Moens³

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts

Self Cite

View full text Add to dashboard Cite

“…This function is a linear discriminant function defined over combined feature representation of inputs and outputs denoted by f (x, y). However, in this work, independent classifiers are trained per role and relations and only the predication is performed based on the global inference as in (Kordjamshidi et al, 2017a;Rahgooy et al, 2018) .…”

Section: Learning Modelmentioning

confidence: 99%

“…The global constraints used in our proposed model is combination of previously proposed constraints (1-7) (Rahgooy et al, 2018) and new one (constraint 8) described in Table 3.3. In fact, the global inference is performed using integer linear programming techniques subject to these constraints.…”

Section: Constraintsmentioning

confidence: 99%

See 1 more Smart Citation

Proceedings of the First International Workshop on Spatial Language Understanding

2018

View full text Add to dashboard Cite

The challenge for computational models of spatial descriptions for situated dialogue systems is the integration of information from different modalities. The semantics of spatial descriptions are grounded in at least two sources of information: (i) a geometric representation of space and (ii) the functional interaction of related objects that. We train several neural language models on descriptions of scenes from a dataset of image captions and examine whether the functional or geometric bias of spatial descriptions reported in the literature is reflected in the estimated perplexity of these models. The results of these experiments have implications for the creation of models of spatial lexical semantics for human-robot dialogue systems. Furthermore, they also provide an insight into the kinds of the semantic knowledge captured by neural language models trained on spatial descriptions, which has implications for image captioning systems.

show abstract

Visually Guided Spatial Relation Extraction from Text

Cited by 6 publications

References 21 publications

Anaphora Resolution for Improving Spatial Relation Extraction from Text

Anaphora Resolution for Improving Spatial Relation Extraction from Text

Representation, Learning and Reasoning on Spatial Language for Downstream NLP Tasks

Proceedings of the First International Workshop on Spatial Language Understanding

Contact Info

Product

Resources

About