2015
DOI: 10.1007/978-3-319-25639-9_38
|View full text |Cite
|
Sign up to set email alerts
|

Sorted Neighborhood for Schema-Free RDF Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 14 publications
0
6
0
Order By: Relevance
“…Note that the recall metric is computed differently in Table 2. Specifically, the number of true positives in the retrieved 500 duplicates is divided by the quantity min(500, |Ω m |), where |Ω m | is the actual number of matching entities ( Table 1), instead of |Ω m | (as with traditional recall computation 36 ). The table shows that the proposed TSG equals or outperforms the 36 The reason for bounding the denominator in this particular experiment is to prevent the recall from exceeding 100%.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Note that the recall metric is computed differently in Table 2. Specifically, the number of true positives in the retrieved 500 duplicates is divided by the quantity min(500, |Ω m |), where |Ω m | is the actual number of matching entities ( Table 1), instead of |Ω m | (as with traditional recall computation 36 ). The table shows that the proposed TSG equals or outperforms the 36 The reason for bounding the denominator in this particular experiment is to prevent the recall from exceeding 100%.…”
Section: Methodsmentioning
confidence: 99%
“…In a preliminary workshop report, we demonstrated that the empirical benefits of relational DNF blocking schemes can also be realized on schema-free RDF data, if the formalism is appropriately adopted [35]. In follow-up work, we showed that the schemes can be used with numerous blocking algorithms, including an adapted version of the classic Sorted Neighborhood algorithm [36]. This article comprehensively develops DNF blocking scheme learning on RDF data, and uses it to propose a novel Set Covering-based learning algorithm with convenient theoretical properties (Section 4.5.1).…”
Section: Supervised Systemsmentioning
confidence: 99%
“…A commonly used schema-agnostic blocking approach is Token-Blocking [34,36,37,51,52] in which individual records are split into bags of all possible tokens (individual words) and grouped by their commonly shared tokens. Figure 4.5 depicts Token-Blocking applied to the same records of Fig.…”
Section: Schema-agnostic Blocking Approachesmentioning
confidence: 99%
“…In [36], the authors note that Sorted Neighbourhood assumes that a sorting key can be applied to all records of a dataset (or datasets) as they all share the same attributes (i.e. schema).…”
Section: Sorted Neighbourhoodmentioning
confidence: 99%
“…We presented a second algorithm that works along similar lines but uses a different, lowercomplexity, matching algorithm instead [Kejriwal and Miranker 2014]. A second approach to addressing multiple types, which has been gaining traction recently, is to devise schema-free algorithms that do not need to match types before performing instance matching [Rong et al 2012], [Kejriwal and Miranker 2015b].…”
Section: Type Alignmentmentioning
confidence: 99%