2019
DOI: 10.1007/978-3-030-22744-9_16
|View full text |Cite
|
Sign up to set email alerts
|

Missing Features Reconstruction and Its Impact on Classification Accuracy

Abstract: In real-world applications, we can encounter situations when a well-trained model has to be used to predict from a damaged dataset. The damage caused by missing or corrupted values can be either on the level of individual instances or on the level of entire features. Both situations have a negative impact on the usability of the model on such a dataset. This paper focuses on the scenario where entire features are missing which can be understood as a specific case of transfer learning. Our aim is to experimenta… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 19 publications
0
3
0
Order By: Relevance
“…Additionally, the impact of missing data and imputation methods on the analysis of activity patterns underscores the importance of accurate imputation techniques [32]. Furthermore, the use of artificial neural networks for missing feature reconstruction highlights the relevance of advanced techniques in imputation [33]. Moreover, neural models have been employed for the imputation of missing ozone data, demonstrating the applicability of machine learning in addressing missing data in various domains [34].…”
Section: Data Imputation Methodsmentioning
confidence: 99%
“…Additionally, the impact of missing data and imputation methods on the analysis of activity patterns underscores the importance of accurate imputation techniques [32]. Furthermore, the use of artificial neural networks for missing feature reconstruction highlights the relevance of advanced techniques in imputation [33]. Moreover, neural models have been employed for the imputation of missing ozone data, demonstrating the applicability of machine learning in addressing missing data in various domains [34].…”
Section: Data Imputation Methodsmentioning
confidence: 99%
“…The abnormal values are first removed from the data set, and then, linear interpolation is used for imputation of the missing data. Previous studies have demonstrated that using linear interpolation method to impute the data set with missing data less than 5% can achieve almost the same model accuracy as the complete data set. The statistical descriptions of the data after cleaning are also shown in Tables S1 and S2. The data set of 27 months (4938 samples) is divided into 9 blocks on average, each of which has similar seasonal characteristics.…”
Section: Case Studymentioning
confidence: 99%
“…ANN may become trapped in a local minimum on large datasets [56]. XGBT is suitable for processing structured feature data and unstructured data, which is not a good processing ability for unstructured data [57]. However, machine learning models are black box models.…”
Section: The Differences and Shortcomings Of The Machine Learning Modelsmentioning
confidence: 99%