2020
DOI: 10.1109/access.2020.2994033
|View full text |Cite
|
Sign up to set email alerts
|

Iterative Robust Semi-Supervised Missing Data Imputation

Abstract: In many real-world applications scientists are often confronted with the problem of incomplete datasets due to several reasons. The direct analysis of datasets with missing values in attributes inevitably results in inaccurate learning models and erroneous results. Facing effectively the challenge of missing values is an essential step of the data mining process. Imputation is often employed to overcome the shortcomings incurred by missing data during the pre-process stage of data analysis. Therefore, a pletho… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8
2

Relationship

2
8

Authors

Journals

citations
Cited by 23 publications
(11 citation statements)
references
References 50 publications
(84 reference statements)
0
11
0
Order By: Relevance
“…As future work, at first, it would be beneficial to apply different techniques for handling of missing values such as [82] and experiment with even more feature selection techniques. Moreover, it would be interesting to evaluate the impact of dimentionality reduction with techniques such as principal component analysis [83] in T2DM prediction This work is licensed under a Creative Commons Attribution 4.0 License.…”
Section: Discussionmentioning
confidence: 99%
“…As future work, at first, it would be beneficial to apply different techniques for handling of missing values such as [82] and experiment with even more feature selection techniques. Moreover, it would be interesting to evaluate the impact of dimentionality reduction with techniques such as principal component analysis [83] in T2DM prediction This work is licensed under a Creative Commons Attribution 4.0 License.…”
Section: Discussionmentioning
confidence: 99%
“…Second, it is based on machine learning. Third, it is based on deep learning [6]. Statistical analysis methods include mean imputation, regression imputation, hot deck imputation, multiple imputation, and multiple implications by chained equations (MICE).…”
Section: Background Theorymentioning
confidence: 99%
“…Considering that the quality of data is essential for building effective and robust ML models [27], a preprocess analysis was performed for cleaning and preparing the data before applying a ML algorithm. For this purpose, the missing values of the numerical attributes were imputed employing the mean imputation method.…”
Section: Data Descriptionmentioning
confidence: 99%