Proceedings of the 23rd International Conference on World Wide Web 2014
DOI: 10.1145/2567948.2578999
|View full text |Cite
|
Sign up to set email alerts
|

Learning conflict resolution strategies for cross-language Wikipedia data fusion

Abstract: In order to efficiently use the ever growing amounts of structured data on the web, methods and tools for quality-aware data integration should be devised. In this paper we propose an approach to automatically learn the conflict resolution strategies, which is a crucial step in large-scale data integration. The approach is implemented as an extension of the Sieve data quality assessment and fusion framework. We apply and evaluate our approach on the use case of fusing data from 10 language editions of DBpedia,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4
2
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 33 publications
(33 citation statements)
references
References 12 publications
0
33
0
Order By: Relevance
“…One of such solutions is DBpedia. The problem that is often encountered is a conflict resolution, which occurs when various language versions concerning the same subject have conflicting information [21]. Our quality metrics can help in building more effective conflict resolution strategies for data fusing.…”
Section: Discussionmentioning
confidence: 99%
“…One of such solutions is DBpedia. The problem that is often encountered is a conflict resolution, which occurs when various language versions concerning the same subject have conflicting information [21]. Our quality metrics can help in building more effective conflict resolution strategies for data fusing.…”
Section: Discussionmentioning
confidence: 99%
“…Recently, much work in Natural Language Processing focused on Knowledge Base Completion (Nickel et al, 2016a, KBC), the task of enriching and refining existing KBs. Many different methods have been explored for KBC, including exploitation of resources such as text corpora (Snow et al, 2006;Mintz et al, 2009;Aprosio et al, 2013) or other KBs Bryl and Bizer, 2014) for acquiring additional knowledge. Alternative approaches, in contrast, primarily rely on existing information from the KB itself (Socher et al, 2013;Nickel et al, 2016b) used as ground-truth to simultaneously learn continuous representations of KB concepts and relations, which are used to infer additional KB relations.…”
Section: Related Workmentioning
confidence: 99%
“…External KBC approaches use outer knowledge like text corpora Aprosio et al, 2013) or other KBs Bryl and Bizer, 2014) for acquiring additional knowledge. The text-based external methods typically employ a form of a distant supervi-sion.…”
Section: Related Workmentioning
confidence: 99%