2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS) 2016
DOI: 10.1109/icsess.2016.7883053
|View full text |Cite
|
Sign up to set email alerts
|

Variable selection using Mean Decrease Accuracy and Mean Decrease Gini based on Random Forest

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
59
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 141 publications
(83 citation statements)
references
References 2 publications
2
59
0
Order By: Relevance
“…In studies characterized by very high dimensionality, it is crucial to estimate the importance of each predictive variable in classifying the data in order to determine the variables performance. In order to detect the predictive power of the input variables within the RF algorithm, the Mean Decrease in Accuracy (MDA) and Mean Decreased Gini (MDG) for individual vegetation indices across all RF trees was analysed (Han et al, 2016). The Gini index is a measure of the homogeneity and purity of nodes and leaves.…”
Section: Validation Classification Accuracy Assessment and Comparisomentioning
confidence: 99%
“…In studies characterized by very high dimensionality, it is crucial to estimate the importance of each predictive variable in classifying the data in order to determine the variables performance. In order to detect the predictive power of the input variables within the RF algorithm, the Mean Decrease in Accuracy (MDA) and Mean Decreased Gini (MDG) for individual vegetation indices across all RF trees was analysed (Han et al, 2016). The Gini index is a measure of the homogeneity and purity of nodes and leaves.…”
Section: Validation Classification Accuracy Assessment and Comparisomentioning
confidence: 99%
“…The Gini impurity metric accounts for how often a randomly selected component from a training set would be incorrectly labelled if it was randomly labelled according to the class distribution in a subset [61]. The mean decrease in the Gini, also known as Gini importance, is the total decrease in node impurities from splitting on the variable, averaged over all trees [62]. This is essentially a measure of how important a variable is for estimating the value of the target variable across all tress in the forest.…”
Section: Random Forestmentioning
confidence: 99%
“…To assess the relative importance of all available input variables (Table 2) aggregated to the ground as in Sect. 3.3 and to choose the most pertinent ones, an approach from Han et al (2016) has been adapted for regression. Assuming as before that M is the number of available input features, the method is described in Algorithm 1.…”
Section: Choice Of Input Featuresmentioning
confidence: 99%