Proceedings of the Main Conference on Human Language Technology Conference of the North American Chapter of the Association of 2006
DOI: 10.3115/1220835.1220874
|View full text |Cite
|
Sign up to set email alerts
|

Preemptive information extraction using unrestricted relation discovery

Abstract: We are trying to extend the boundary of Information Extraction (IE) systems. Existing IE systems require a lot of time and human effort to tune for a new scenario. Preemptive Information Extraction is an attempt to automatically create all feasible IE systems in advance without human intervention. We propose a technique called Unrestricted Relation Discovery that discovers all possible relations from texts and presents them as tables. We present a preliminary system that obtains reasonably good results.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
88
0
3

Year Published

2010
2010
2017
2017

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 148 publications
(91 citation statements)
references
References 7 publications
0
88
0
3
Order By: Relevance
“…The open information extraction paradigm, simultaneously proposed by Shinyama and Sekine (2006) and Banko et al (2007), does not rely on any labeled data or even existing relations. Instead, open information extraction systems only use an unlabeled corpus, and output a set of extracted relations.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The open information extraction paradigm, simultaneously proposed by Shinyama and Sekine (2006) and Banko et al (2007), does not rely on any labeled data or even existing relations. Instead, open information extraction systems only use an unlabeled corpus, and output a set of extracted relations.…”
Section: Related Workmentioning
confidence: 99%
“…Instead, open information extraction systems only use an unlabeled corpus, and output a set of extracted relations. Such systems are based on clustering (Shinyama and Sekine, 2006) or self-supervision (Banko et al, 2007). One of the limitations of these systems is the fact that they extract uncanonicalized relations.…”
Section: Related Workmentioning
confidence: 99%
“…One way to reason about KB and surface relations is to cluster the relations: whenever two relations appear in the same cluster, they are treated as synonymous (Hasegawa et al, 2004;Shinyama and Sekine, 2006;Yao et al, 2011;Takamatsu et al, 2011;Min et al, 2012;Akbik et al, 2012;de Lacalle and Lapata, 2013). For example, if "criticizes" and "hates" are clustered together, then we may predict "hates"("Dante", "Catholic Church") from the above fact (which is actually not true).…”
Section: Related Workmentioning
confidence: 99%
“…Following the idea of preemptive Information Extraction (Shinyama and Sekine, 2006), we pre-extract and store all subtrees and entity types from a given corpus for each sentence with at least two named entities. This allows not only fast retrieval of matching entity pairs for a given set of subtrees and type restrictions, but also allows us to compute pattern correlations over the entire dataset for the presently selected setup.…”
Section: Preemptive Pattern Extractionmentioning
confidence: 99%