2015 International Conference on Pervasive Computing (ICPC) 2015
DOI: 10.1109/pervasive.2015.7087099
|View full text |Cite
|
Sign up to set email alerts
|

A survey analysis on duplicate detection in Hierarchical Data

Abstract: Electronic Data Processing used automated methods for processing commercial data. There is big amount of work on discovering duplicates in relational data; merely elite findings concentrate on duplication in additional multifaceted hierarchical structures. Electronic information is one of the key factors in several business operations, applications, and determinations, at the same time as an outcome, guarantee its superiority is necessary. Data superiority, on the other hand, can be adjusted by different kind … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 9 publications
(3 reference statements)
0
2
0
Order By: Relevance
“…Yumusak, Dogdu, and Kodaz (2014) , present a brief survey dealing with linked data ranking, classifying methods in: ontology ranking, RDF (Resource Description Framework) document ranking, graph ranking, entity ranking and document/source ranking. Gaikwad and Bogiri (2015) , present a survey analysis on duplicated detection in the domain of hierarchical data. They have oriented the paper to experts who are doing research in duplicate detection in xml data or hierarchical data.…”
Section: Related Workmentioning
confidence: 99%
“…Yumusak, Dogdu, and Kodaz (2014) , present a brief survey dealing with linked data ranking, classifying methods in: ontology ranking, RDF (Resource Description Framework) document ranking, graph ranking, entity ranking and document/source ranking. Gaikwad and Bogiri (2015) , present a survey analysis on duplicated detection in the domain of hierarchical data. They have oriented the paper to experts who are doing research in duplicate detection in xml data or hierarchical data.…”
Section: Related Workmentioning
confidence: 99%
“…ER is a significant and common data cleaning problem, and it consists of detecting data duplicate representations for the same external entities, and merging them into single representations [ 43 ]. This problem can be applied to many different domains, such as deduplication in databases [ 44 ], duplicate detection in data or hierarchical data [ 45 ], cross-document co-reference resolution methods and tools [ 46 ], blocking techniques [ 43 , 47 ], bug reports [ 48 ], customer recognition [ 31 ], and E-health [ 49 ]. Most of the existing studies have been validated using real-world datasets, but very few of them have applied their proposal in a real case in the industry [ 42 ].…”
Section: Introductionmentioning
confidence: 99%