2003
DOI: 10.1007/s00500-003-0310-2
|View full text |Cite
|
Sign up to set email alerts
|

Genetic algorithms for outlier detection and variable selection in linear regression models

Abstract: This article addresses some problems in outlier detection and variable selection in linear regression models. First, in outlier detection there are problems known as smearing and masking. Smearing means that one outlier makes another, non-outlier observation appear as an outlier, and masking that one outlier prevents another one from being detected. Detecting outliers one by one may therefore give misleading results. In this article a genetic algorithm is presented which considers different possible groupings … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
25
0
4

Year Published

2009
2009
2020
2020

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 67 publications
(30 citation statements)
references
References 18 publications
0
25
0
4
Order By: Relevance
“…Afterwards, the authors in [32] used genetic algorithms for detection of outliers and sample selection in linear regression models in the context of cross-section data. In [33], sample selection was carried out in the framework of Multi Objective Evolutionary Learning (MOEL) of Fuzzy Rule-Based Systems (FRBSs) by using a co-evolutionary method.…”
Section: Related Workmentioning
confidence: 99%
“…Afterwards, the authors in [32] used genetic algorithms for detection of outliers and sample selection in linear regression models in the context of cross-section data. In [33], sample selection was carried out in the framework of Multi Objective Evolutionary Learning (MOEL) of Fuzzy Rule-Based Systems (FRBSs) by using a co-evolutionary method.…”
Section: Related Workmentioning
confidence: 99%
“…On the other hand, there only very few approaches to instance selection for regression tasks can be found in the literature and in the the publications we were able to find the experiments were conducted only on artificial datasets, which were generated especially for the purpose of the experiments. Tolvi (2004) presented a genetic algorithm to perform feature and instance selection for linear regression models. In their works Guillen (2009) discussed the concept of mutual information used for selection of prototypes in regression problems.…”
Section: Instance Selectionmentioning
confidence: 99%
“…This has to be carried out by estimating, the mapping-function between inputs and outputs of the model. Such functions are usually highly non-linear and have to be computed using adaptive vector quantization (AVQ) (Tolvi (2004); Hongwei et al (2005)) based unidirectional temporary associative memory (UTAM) of neural-network, as depicted in figure 18.…”
Section: Understanding Of Awm and Vdmmentioning
confidence: 99%