2005
DOI: 10.1007/11527503_69
|View full text |Cite
|
Sign up to set email alerts
|

An Analysis of Missing Data Treatment Methods and Their Application to Health Care Dataset

Abstract: Abstract.It is well accepted that many real-life datasets are full of missing data. In this paper we introduce, analyze and compare several well known treatment methods for missing data handling and propose new methods based on Naive Bayesian classifier to estimate and replace missing data. We conduct extensive experiments on datasets from UCI to compare these methods. Finally we apply these models to a geriatric hospital dataset in order to assess their effectiveness on a real-life dataset.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2006
2006
2022
2022

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(9 citation statements)
references
References 5 publications
0
8
0
Order By: Relevance
“…However, low applicability raises the issue of dealing with missing data, increasingly discussed in the literature [ 29 ]. In order to avoid loss of power we applied a combination of two popular methods: case deletion and constant replacement [ 30 ].…”
Section: Discussionmentioning
confidence: 99%
“…However, low applicability raises the issue of dealing with missing data, increasingly discussed in the literature [ 29 ]. In order to avoid loss of power we applied a combination of two popular methods: case deletion and constant replacement [ 30 ].…”
Section: Discussionmentioning
confidence: 99%
“…A recent application in which incomplete data appear is biology research with DNA microarrays, where the gene data may be missing due to various reasons such as scratch on the slide or contaminated samples [20,21]. Missing values are common in medical diagnosis, e.g., a medical practitioner may not order a test whose outcome appears certain or not relevant to the diagnosis, or a feature can be missing because it proved to be difficult/harmful to measure [22][23][24][25].…”
Section: Missing Data Mechanismsmentioning
confidence: 99%
“…Imputation technique is one of the widely used missing data treatment methods [11]. The basic idea of Naive Bayesian Imputation (NBI) is first to define the feature to be imputed, called 'imputation feature' and then construct the NBC using the imputation feature as the class feature.…”
Section: Nbi Modelsmentioning
confidence: 99%