2012 IEEE 28th International Conference on Data Engineering 2012
DOI: 10.1109/icde.2012.20
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Windows for Duplicate Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 60 publications
(41 citation statements)
references
References 21 publications
(18 reference statements)
0
41
0
Order By: Relevance
“…On the other hand, unnecessary comparison carried out when window size too large. To achieve effectiveness adaptive window size is used [3], [5], [9], [11]. In order to make duplication detection solution applicable, consider that adaptively plays important role.…”
Section: Sorted Neighborhood Methode and Windowingmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, unnecessary comparison carried out when window size too large. To achieve effectiveness adaptive window size is used [3], [5], [9], [11]. In order to make duplication detection solution applicable, consider that adaptively plays important role.…”
Section: Sorted Neighborhood Methode and Windowingmentioning
confidence: 99%
“…So the overall number of comparisons is getting reduced. In the past years numbers of blocking algorithms have been proposed by researchers [10], [11], [12], [13], [14]. These techniques typically form blocks or groups of observations using sorting or indexing.…”
Section: Blockingmentioning
confidence: 99%
“…Indexing techniques, discussed in more detail later, range from simple phonetic based blocking [4] and sorting of the datasets [11] to locality sensitive hashing based techniques [18,29], and unsupervised [17,26] and supervised [1,22] learning of optimal blocking schemes.…”
Section: Introductionmentioning
confidence: 99%
“…The recognition algorithm is composed of SNM algorithm [3], MPN algorithm [4] and KNN algorithm [5]. The SNM algorithm is a standard way to detect the similar duplicate records.…”
Section: Introductionmentioning
confidence: 99%