Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2014
DOI: 10.3115/v1/d14-1086
|View full text |Cite
|
Sign up to set email alerts
|

ReferItGame: Referring to Objects in Photographs of Natural Scenes

Abstract: In this paper we introduce a new game to crowd-source natural language referring expressions. By designing a two player game, we can both collect and verify referring expressions directly within the game. To date, the game has produced a dataset containing 130,525 expressions, referring to 96,654 distinct objects, in 19,894 photographs of natural scenes. This dataset is larger and more varied than previous REG datasets and allows us to study referring expressions in real-world scenes. We provide an in depth an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

5
830
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 828 publications
(835 citation statements)
references
References 24 publications
5
830
0
Order By: Relevance
“…As we are interested in tracking by natural language specification, we augment the videos in OTB100 with natural language descriptions of the target object. Following the guidelines in [19] we ask annotators for a discriminative referring description of the target. For fairness the annotators describe the target based on the first frame only.…”
Section: Datasetsmentioning
confidence: 99%
See 4 more Smart Citations
“…As we are interested in tracking by natural language specification, we augment the videos in OTB100 with natural language descriptions of the target object. Following the guidelines in [19] we ask annotators for a discriminative referring description of the target. For fairness the annotators describe the target based on the first frame only.…”
Section: Datasetsmentioning
confidence: 99%
“…ReferIt [19]. The ReferIt dataset is proposed in [19] for the task of object localization and segmentation by natural language expression.…”
Section: Datasetsmentioning
confidence: 99%
See 3 more Smart Citations