Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management 2014
DOI: 10.1145/2661829.2662073
|View full text |Cite
|
Sign up to set email alerts
|

Canonicalizing Open Knowledge Bases

Abstract: Open information extraction approaches have led to the creation of large knowledge bases from the Web. The problem with such methods is that their entities and relations are not canonicalized, leading to redundant and ambiguous facts. For example, they may store Barack Obama, was born in, Honolulu and Obama, place of birth, Honolulu . In this paper, we present an approach based on machine learning methods that can canonicalize such Open IE triples, by clustering synonymous names and phrases.We also provide a d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
87
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 93 publications
(92 citation statements)
references
References 20 publications
(18 reference statements)
1
87
0
Order By: Relevance
“…However, OIE extractions provide a suitable starting point which is exploited by ENTICE. (Galárraga et al, 2014) addresses the problem of normalizing (or canonicalizing) OIE extractions which can be considered as one of the components of ENTICE (see Section 3.3).…”
Section: Related Workmentioning
confidence: 99%
“…However, OIE extractions provide a suitable starting point which is exploited by ENTICE. (Galárraga et al, 2014) addresses the problem of normalizing (or canonicalizing) OIE extractions which can be considered as one of the components of ENTICE (see Section 3.3).…”
Section: Related Workmentioning
confidence: 99%
“…There has been some studies [6, 14] on clustering synonymous relation phrases based on different kinds of signals and clustering methods (see Sec. 6).…”
Section: Clustering-integrated Type Propagation On Graphsmentioning
confidence: 99%
“…In particular, type signatures of relation phrases have proven very useful in clustering of relation phrases which have infrequent or ambiguous strings and contexts [6]. In contrast to previous approaches, our method leverages the type information derived by the type propagation and thus does not rely strictly on external sources to determine the type information for all the entity arguments.…”
Section: Clustering-integrated Type Propagation On Graphsmentioning
confidence: 99%
“…In [4], we canonicalize the relations of a significant percentage (up to 33%) of the triples of a set of 1.3M Reverb extractions from ClueWeb09 3 . Our method assumes subjects and objects have been canonicalized somehow, for example by linking them to Freebase using the method described in [7].…”
Section: Canonicalization Of Open Kbsmentioning
confidence: 99%