2020
DOI: 10.1186/s40537-020-00313-w
|View full text |Cite
|
Sign up to set email alerts
|

SICE: an improved missing data imputation technique

Abstract: In data analytics, missing data is a factor that degrades performance. Incorrect imputation of missing values could lead to a wrong prediction. In this era of big data, when a massive volume of data is generated in every second, and utilization of these data is a major concern to the stakeholders, efficiently handling missing values becomes more important. In this paper, we have proposed a new technique for missing data imputation, which is a hybrid approach of single and multiple imputation techniques. We hav… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
77
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 146 publications
(99 citation statements)
references
References 28 publications
0
77
0
Order By: Relevance
“…However, simple imputation methods may produce bias or unrealistic results on a high-dimensional data sets. Also, with the generation of big data emerging, this method seems to be performing poorly and therefore is inadequate to be implemented on such data sets [42].…”
Section: Simple Imputationmentioning
confidence: 99%
See 1 more Smart Citation
“…However, simple imputation methods may produce bias or unrealistic results on a high-dimensional data sets. Also, with the generation of big data emerging, this method seems to be performing poorly and therefore is inadequate to be implemented on such data sets [42].…”
Section: Simple Imputationmentioning
confidence: 99%
“…However, their multiple imputation technique performed better than the other conventional methods. There has been a study also by [42], that explored a multiple imputation approach that extended multivariate imputation by chained equation for big data. The approach had presented two variants one for categorical and the other numeric data and implemented twelve existing algorithms for performance comparison.…”
Section: Multiple Imputationmentioning
confidence: 99%
“…Multiple imputation using linear regression generates imputations by building a model from observed data and predicting the missing values from the fitted model using the spread around the fitted linear regression line of y given x, as fitted on the observed data [1,57]. Here, the analysis is performed by point estimates to find the single best value around the regression line [68].…”
Section: Monotone Linear Regression (Lr) Yesmentioning
confidence: 99%
“…Multiple imputation using Bayesian linear regression [70] is much like linear regression. However, the imputation is done within the context of Bayesian inference where the missing values are drawn from a Bayesian posterior predictive distribution for the observed data [68,71]. Thus BLR seeks to find out the posterior distribution for the model parameters rather than finding a single best value [68].…”
Section: Predictive Mean Matching (Pmm) Yesmentioning
confidence: 99%
See 1 more Smart Citation