2010
DOI: 10.1007/s10489-009-0207-6
|View full text |Cite
|
Sign up to set email alerts
|

Shell-neighbor method and its application in missing data imputation

Abstract: Data preparation is an important step in mining incomplete data. To deal with this problem, this paper introduces a new imputation approach called SN (Shell Neighbors) imputation, or simply SNI. The SNI fills in an incomplete instance (with missing values) in a given dataset by only using its left and right nearest neighbors with respect to each factor (attribute), referred them to Shell Neighbors. The left and right nearest neighbors are selected from a set of nearest neighbors of the incomplete instance. The… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
53
0
2

Year Published

2012
2012
2018
2018

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 111 publications
(56 citation statements)
references
References 24 publications
1
53
0
2
Order By: Relevance
“…The simplest way is imputation with constant values, zeros, random values, mean values (over all data set [18], over the class of the data item [9]). The more-sophisticated techniques are nearestneighbor selection [36,37], Expectation-Maximization (EM) algorithm [8] or hot-deck [23] and cold-deck [13] techniques to avoid imputation of non-existing values.…”
Section: Imputationmentioning
confidence: 99%
“…The simplest way is imputation with constant values, zeros, random values, mean values (over all data set [18], over the class of the data item [9]). The more-sophisticated techniques are nearestneighbor selection [36,37], Expectation-Maximization (EM) algorithm [8] or hot-deck [23] and cold-deck [13] techniques to avoid imputation of non-existing values.…”
Section: Imputationmentioning
confidence: 99%
“…Non-parametric method is applied when the relationship between the conditional www.ijacsa.thesai.org attributes is unknown. Parametric methods like Nearest Neighbour [4][10] [25] have been used for the prediction of missing attribute(s). Non-parametric technique such as empirical likelihood [32], clustering [26], Semi-parametric techniques [21] [33] have also been applied for missing data imputation.…”
Section: Related Workmentioning
confidence: 99%
“…It exploits the k relevant instances of the data and provides simplicity, ease of implementation and achieved high accuracy. [9]. Also, the fuzzy rule based is widely in the data imputation method which sculpts the linguistic model structure which has the tendency to evaluate the value of missing data and mitigates the dimension reduction problem [10].…”
Section: Introductionmentioning
confidence: 99%