2014
DOI: 10.4236/jdaip.2014.24012
|View full text |Cite
|
Sign up to set email alerts
|

A Feature Subset Selection Technique for High Dimensional Data Using Symmetric Uncertainty

Abstract: With the abundance of exceptionally High Dimensional data, feature selection has become an essential element in the Data Mining process. In this paper, we investigate the problem of efficient feature selection for classification on High Dimensional datasets. We present a novel filter based approach for feature selection that sorts out the features based on a score and then we measure the performance of four different Data Mining classification algorithms on the resulting data. In the proposed approach, we part… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
24
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 61 publications
(24 citation statements)
references
References 18 publications
0
24
0
Order By: Relevance
“…Symmetrical uncertainty (SU) is an information entropy-based filtering approach that measures the changes in entropy based on the concept of information entropy in order to assess the similarity or mutual information between the GCM and observed datasets [33,35]. The information entropy estimates the amount of information common between the two variables.…”
Section: Model Selection Using Symmetrical Uncertaintymentioning
confidence: 99%
See 1 more Smart Citation
“…Symmetrical uncertainty (SU) is an information entropy-based filtering approach that measures the changes in entropy based on the concept of information entropy in order to assess the similarity or mutual information between the GCM and observed datasets [33,35]. The information entropy estimates the amount of information common between the two variables.…”
Section: Model Selection Using Symmetrical Uncertaintymentioning
confidence: 99%
“…The information entropy estimates the amount of information common between the two variables. For example, if P(X) and P(Y) are the probability density functions and P(X, Y) is the joint probability density function of A and B, then the entropy H between X and Y is given in Equation (1) below [35,52]:…”
Section: Model Selection Using Symmetrical Uncertaintymentioning
confidence: 99%
“…F-correlation: the SU between any pair of features F i and F j (i = j), denoted by SU i,j . The use of symmetrical uncertainty has been proved to be useful in dimensionality reduction in previous studies [24][25][26][27][28][29].…”
Section: Symmetrical Uncertaintymentioning
confidence: 99%
“…To find the key factors that affect load patterns, the filter models [21] work the best among existing feature selection methods. Specifically, the entropy-based class measurement [22], which has been widely employed in filter models, is utilized in our study.…”
Section: A Socioeconomic Features Selectionmentioning
confidence: 99%