Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing 2016
DOI: 10.18653/v1/d16-1199
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Answer Questions from Wikipedia Infoboxes

Abstract: A natural language interface to answers on the Web can help us access information more efficiently. We start with an interesting source of information-infoboxes in Wikipedia that summarize factoid knowledge-and develop a comprehensive approach to answering questions with high precision. We first build a system to access data in infoboxes in a structured manner. We use our system to construct a crowdsourced dataset of over 15,000 highquality, diverse questions. With these questions, we train a convolutional neu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(11 citation statements)
references
References 13 publications
0
11
0
Order By: Relevance
“…educated at(T uring, P rinceton)), collecting roughly 100,000 natural-language questions to support QA against a knowledge graph. Morales et al (2016) used a similar process to collect questions from Wikipedia infoboxes, yielding the 15,000-example InfoboxQA dataset. For the task of identifying predicate-argument structures, QA-SRL (He et al, 2015) was proposed as an open schema for semantic roles, in which the relation between an argument and a predicate is expressed as a natural-language question containing the predicate ("Where was someone educated?")…”
Section: Negative Examplesmentioning
confidence: 99%
“…educated at(T uring, P rinceton)), collecting roughly 100,000 natural-language questions to support QA against a knowledge graph. Morales et al (2016) used a similar process to collect questions from Wikipedia infoboxes, yielding the 15,000-example InfoboxQA dataset. For the task of identifying predicate-argument structures, QA-SRL (He et al, 2015) was proposed as an open schema for semantic roles, in which the relation between an argument and a predicate is expressed as a natural-language question containing the predicate ("Where was someone educated?")…”
Section: Negative Examplesmentioning
confidence: 99%
“…WIKIPEDIA contains an abundance of humancurated, multi-domain information and has several structured resources such as infoboxes and WIKIDATA (Vrandečić, 2012) associated with it. WIKIPEDIA has thus been used for a wealth of research to build datasets posing queries about a single sentence (Morales et al, 2016;Levy et al, 2017) or article (Yang et al, 2015;Hewlett et al, 2016;Rajpurkar et al, 2016). However, no attempt has been made to construct a cross-document multi-step RC dataset based on WIKIPEDIA.…”
Section: Wikihopmentioning
confidence: 99%
“…Articles are organized in hierarchical sections, and many have an "infobox," a table that summarizes key information in the article. To access these kinds of information, we developed WikipediaBase (Morales, 2016), a system that turns Wikipedia into a virtual database and organizes it in an object-property-value data model. We consider infobox attributes and section headers to be properties.…”
Section: Start Parses These Annotations and Stores The Parsed Structumentioning
confidence: 99%
“…To address these types of questions, we compiled a crowdsourced corpus of over 15,000 questions about Wikipedia infoboxes. We used these questions to train a machine learning model that selects the correct response from a set of candidate answers with high accuracy (Morales, 2016;Morales, Premtoon, Avery, Felshin, & Katz, 2016). Our ongoing work in automatic techniques to answer questions will allow the START system to quickly scale up to new types of questions and information sources.…”
Section: Start Parses These Annotations and Stores The Parsed Structumentioning
confidence: 99%