2016
DOI: 10.15587/1729-4061.2016.86243
|View full text |Cite
|
Sign up to set email alerts
|

Detection of near dublicates in tables based on the locality-sensitive hashing method and the nearest neighbor method

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
17
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
3
2
2

Relationship

2
5

Authors

Journals

citations
Cited by 21 publications
(17 citation statements)
references
References 12 publications
0
17
0
Order By: Relevance
“…This technique is based on the assumption that the abstracts of scientific publications that relate to the same scientific direction will contain the same concepts and keywords, that is, they will be quite similar in content. The method for determining closeness between fragments of text information, but when applied to the task on finding incomplete duplicates, was described in papers [5,6].…”
Section: A Methods For the Clustering Of Publications Of Scientists Bymentioning
confidence: 99%
See 3 more Smart Citations
“…This technique is based on the assumption that the abstracts of scientific publications that relate to the same scientific direction will contain the same concepts and keywords, that is, they will be quite similar in content. The method for determining closeness between fragments of text information, but when applied to the task on finding incomplete duplicates, was described in papers [5,6].…”
Section: A Methods For the Clustering Of Publications Of Scientists Bymentioning
confidence: 99%
“…Using a method of locally-sensitive hashing, in accordance with the method described in paper [5], we shall determine index elements:…”
Section: A Methods For the Clustering Of Publications Of Scientists Bymentioning
confidence: 99%
See 2 more Smart Citations
“…In particular, authors of work [18] have developed a method for detecting incomplete duplicates in tables which is based on the methods of the nearest neighbor and locally sensitive hashing. In work [19], a conceptual model of the system for finding incomplete duplicates using identification of similarities in electronic documents is described.…”
Section: Literature Review and Problem Statementmentioning
confidence: 99%