Using Entropy to Impute Missing Data in a Classification Task

Delavallade, Thomas; Dang, Thanh Ha

doi:10.1109/fuzzy.2007.4295430

Cited by 12 publications

(7 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Therefore, the 'supervised missing imputation' as been applied by means of the method 'Cmean'. This simple method has proved to be very effective, consists in imputing the mean for continuous variables, or the most repeated for categorical variables (Delavallade and Dang, 2007;Little and Rubin, 2002). In the supervised variant of 'Cmean', the imputed values are the mean of the values, in those cases that have the same recruitment level.…”

Section: Supervised Classification Based Methodologymentioning

confidence: 99%

Evaluating machine-learning techniques for recruitment forecasting of seven North East Atlantic fish species

Fernandes

Irigoien

Lozano

et al. 2015

Ecological Informatics

View full text Add to dashboard Cite

Section: Supervised Classification Based Methodologymentioning

confidence: 99%

Evaluating machine-learning techniques for recruitment forecasting of seven North East Atlantic fish species

Fernandes

Irigoien

Lozano

et al. 2015

Ecological Informatics

View full text Add to dashboard Cite

“…Delavallade and Dang [17] propose a new technique, based on the entropy measure, that finds a distribution value with more discrimination power for each missing value. Besides, they propose a new taxonomy for the methods, dividing them into: observation space or variable space, iterative or noniterative, local information or global information, stochastic or deterministic, prediction model or class information.…”

Section: Procedures Based On Direct Manipulation Of Missing Datamentioning

confidence: 99%

A brief review of the main approaches for treatment of missing data

Silva

Zárate

2014

IDA

View full text Add to dashboard Cite

Missing data is a significant problem found in data mining projects and data analysis. Despite being a common problem, the missing data is dealt in a simplistic way and may lead to inconsistent knowledge discovery. Through literature review, it was possible to observe that the missing data mechanisms are not always considered when methods of treatment or imputation are chosen. This work presents a review about the main treatment methods of missing data that can be considered in a process of knowledge discovery in database. We emphasize that attention should be given to the identification of the absence mechanism, for the choice the most appropriate treatment method.

show abstract

“…It consists of the imputation of missing data using complete objects in a small neighborhood of the incomplete ones. In [5], a new approach is proposed , using the entropy to estimate the missing values.…”

Section: Introductionmentioning

confidence: 99%

Some Imputation Algorithms for Restoration of Missing Data

Ryazanov

2011

Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications

View full text Add to dashboard Cite

Abstract. The problem of reconstructing the feature values in samples of objects given in terms of numerical features is considered. The three approaches, not involving the use of probability models and a priori information, are considered. The first approach is based on the organization of the iterative procedure for successive elaboration of missing values of attributes. In this case, the analysis of local information for each object with missing data is fulfilled. The second approach is based on solving an optimization problem. We calculate such previously unknown feature values for which there is maximum correspondence of metric relations between objects in subspaces of known partial values and found full descriptions. The third approach is based on solving a series of recognition tasks for each missing value. Comparisons of these approaches on simulated and real problems are presented.

show abstract

Using Entropy to Impute Missing Data in a Classification Task

Cited by 12 publications

References 9 publications

Evaluating machine-learning techniques for recruitment forecasting of seven North East Atlantic fish species

Evaluating machine-learning techniques for recruitment forecasting of seven North East Atlantic fish species

A brief review of the main approaches for treatment of missing data

Some Imputation Algorithms for Restoration of Missing Data

Contact Info

Product

Resources

About