2009
DOI: 10.1007/s00778-009-0156-z
|View full text |Cite
|
Sign up to set email alerts
|

Qualitative effects of knowledge rules and user feedback in probabilistic data integration

Abstract: In data integration efforts, portal development in particular, much development time is devoted to entity resolution. Often advanced similarity measurement techniques are used to remove semantic duplicates or solve other semantic conflicts. It proves impossible, however, to automatically get rid of all semantic problems. An often-used rule of thumb states that about 90% of the development effort is devoted to semi-automatically resolving the remaining 10% hard cases. In an attempt to significantly decrease hum… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
31
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
2
1

Relationship

3
5

Authors

Journals

citations
Cited by 42 publications
(31 citation statements)
references
References 29 publications
0
31
0
Order By: Relevance
“…Generating more XPaths may improve the ability to find the most suitable XPath. Also, the use of a probabilistic database approach may be able to more robustly address ambiguous situations [17,18].…”
Section: Discussionmentioning
confidence: 99%
“…Generating more XPaths may improve the ability to find the most suitable XPath. Also, the use of a probabilistic database approach may be able to more robustly address ambiguous situations [17,18].…”
Section: Discussionmentioning
confidence: 99%
“…-A developer gradually and interactively defines an ontology with positive and negative knowledge about the correctness of certain (combinations of) annotations. At each iteration, added knowledge is immediately applied improving the extraction result until the result is good enough (see also [17]). -Storage, querying and manipulation of annotations should be scalable.…”
Section: Future Research Directionsmentioning
confidence: 99%
“…To represent the result of the integration, we need a way to capture the uncertainty in the schema mappings, in deduplication, or in resolving conflicting information. This uncertainty can be characterized by probabilistic mappings [26] and probabilistic data integration rules [38,39]. The outcome of the integration process can naturally be viewed as probabilistic XML (which is useful to query, update, and so on).…”
Section: Probabilistic Xml Applicationsmentioning
confidence: 99%