Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2016
DOI: 10.18653/v1/p16-1058
|View full text |Cite
|
Sign up to set email alerts
|

Easy Things First: Installments Improve Referring Expression Generation for Objects in Photographs

Abstract: Research on generating referring expressions has so far mostly focussed on "oneshot reference", where the aim is to generate a single, discriminating expression. In interactive settings, however, it is not uncommon for reference to be established in "installments", where referring information is offered piecewise until success has been confirmed. We show that this strategy can also be advantageous in technical systems that only have uncertain access to object attributes and categories. We train a recently intr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
30
0

Year Published

2016
2016
2018
2018

Publication Types

Select...
4
1

Relationship

3
2

Authors

Journals

citations
Cited by 14 publications
(30 citation statements)
references
References 31 publications
0
30
0
Order By: Relevance
“…Additional evaluation metrics, such as success rates in a human evaluation (cf. Zarrieß and Schlangen (2016)), would be an interesting direction for more detailed investigation here.…”
Section: Word Similarities Many Of the Examples Inmentioning
confidence: 98%
See 3 more Smart Citations
“…Additional evaluation metrics, such as success rates in a human evaluation (cf. Zarrieß and Schlangen (2016)), would be an interesting direction for more detailed investigation here.…”
Section: Word Similarities Many Of the Examples Inmentioning
confidence: 98%
“…These features are then associated in a learning process with certain words, resulting in an association of colour features with colour words, spatial features with prepositions, etc., and based on this, these words can be interpreted with reference to the scene currently presented to the video feed. Whereas Roy's work still looked at relatively simple scenes with graphical objects, research on REG has recently started to investigate set-ups based on real-world images (Kazemzadeh et al, 2014;Gkatzia et al, 2015;Zarrieß and Schlangen, 2016;Mao et al, 2015). Importantly, the lowlevel visual features that can be extracted from these scenes correspond less directly to particular word classes.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…To factor out effects of compositionality and context that arise in reference generation or resolution, we measure how well a predictor for a word w is able to retrieve from a sampled test set objects that have been referred to by w (Schlangen et al, 2016;Zarrieß and Schlangen, 2016a) evaluate on full referring expressions).…”
Section: Experimental Set-upmentioning
confidence: 99%