2015
DOI: 10.14778/2809974.2809989
|View full text |Cite
|
Sign up to set email alerts
|

Enriching data imputation with extensive similarity neighbors

Abstract: Incomplete information often occur along with many database applications, e.g., in data integration, data cleaning or data exchange. The idea of data imputation is to fill the missing data with the values of its neighbors who share the same information. Such neighbors could either be identified certainly by editing rules or statistically by relational dependency networks. Unfortunately, owing to data sparsity, the number of neighbors (identified w.r.t. value equality) is rather limited, especially in the prese… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 41 publications
(23 citation statements)
references
References 19 publications
0
21
0
Order By: Relevance
“…A lot of work deals with queries that are incomplete with respect to missing attribute values in the entries of the (otherwise complete) result set [2,9,14,17,19,21]. A common solution to resolve this kind of incompleteness, as well as fuzzy searches over hidden databases in general, builds on query refactoring [14,17,21] and educated guessing of values [2,9,19]. However, such approaches are not usable in our problem context, since our notion of incomplete queries relates to restrictions of the result set size.…”
Section: Related Workmentioning
confidence: 99%
“…A lot of work deals with queries that are incomplete with respect to missing attribute values in the entries of the (otherwise complete) result set [2,9,14,17,19,21]. A common solution to resolve this kind of incompleteness, as well as fuzzy searches over hidden databases in general, builds on query refactoring [14,17,21] and educated guessing of values [2,9,19]. However, such approaches are not usable in our problem context, since our notion of incomplete queries relates to restrictions of the result set size.…”
Section: Related Workmentioning
confidence: 99%
“…The function determines the structural similarity among the target and , the higher the numerical value is, a more closer structural description of instance is with description [ 31 , 32 ]. As a result, structural attributes are suggested for a tuple with missing attributes.…”
Section: Methodsmentioning
confidence: 99%
“…Differential dependency (DD) [32] is a valuable tool for data imputation [34], data cleaning [28], data repairing [33], and so on. Song et al [34] used the DDs to fill the missing attributes of incomplete objects on static data set via some detected neighbors satisfying the distance constraints on determinant attributes. Song et al [33,36] also explored to repair labels of graph nodes.…”
Section: Related Workmentioning
confidence: 99%