2020
DOI: 10.1002/cpe.5746
|View full text |Cite
|
Sign up to set email alerts
|

Data linking over RDF knowledge graphs: A survey

Abstract: Summary Instance matching (IM) is the process of matching instances across Knowledge Bases (KBs) that refer to the same real‐world object (eg, the same person in two different KBs). Several approaches in the literature were developed to perform this process using different algorithmic techniques and search strategies. In this article, we aim to provide the rationale for IM and to survey the existing algorithms for performing this task. We begin by identifying the importance of such a process and define it form… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 105 publications
0
6
0
Order By: Relevance
“…RDF stores have become increasingly popular as a means to make knowledge available on the web (Assi et al, 2020). We propose an automatic metric for assessing the entity-based semantic adequacy of RDF verbalisers and show that it is effective in highlighting semantic inadequacy even for state-ofthe-art models with high BLEU scores.…”
Section: Discussionmentioning
confidence: 99%
“…RDF stores have become increasingly popular as a means to make knowledge available on the web (Assi et al, 2020). We propose an automatic metric for assessing the entity-based semantic adequacy of RDF verbalisers and show that it is effective in highlighting semantic inadequacy even for state-ofthe-art models with high BLEU scores.…”
Section: Discussionmentioning
confidence: 99%
“…In each iteration, we categorized the tokens into different themes based on their authors' descriptions to refine our classification. Some tokens were found in multiple clusters in subsequent iterations -as in the case where Assi et al [26] mentioned that "the incorrectness simply refers to the data typographical errors"-which can affect the value of a predicate as well as on the predicate itself. As a result of the first iteration, we were unable to classify some heterogeneities, which prompted us to establish a fifth primary level called "Problem at graph level".…”
Section: An Iterative Methodologymentioning
confidence: 99%
“…* Data quality dimension: for the problems of transgression of good practice, heterogeneity of value type or non-updated dataset. • Assi et al [26] address the issue of IM. They introduce the scalability problem when it comes to IM on large datasets.…”
Section: Review Of Articles Addressing Different Heterogeneity Issuesmentioning
confidence: 99%
See 1 more Smart Citation
“…Tasks such as data cleaning, resolving inconsistencies, and standardizing formats can be undertaken during this stage. Semantic technologies like RDF (Resource Description Framework) (Assi, Mcheick, and Dhifli 2020) and OWL (Web Ontology Language) (Chen, Jia, and Xiang 2020) may also help in facilitating this process as they structure the data into a format that is machine-readable.…”
Section: E-liability Knowledge Graph Data Acquisition and Processingmentioning
confidence: 99%