2021
DOI: 10.19139/soic-2310-5070-1092
|View full text |Cite
|
Sign up to set email alerts
|

Feature Selection Based on Divergence Functions: A Comparative Classiffication Study

Abstract: Due to the extensive use of high-dimensional data and its application in a wide range of scientifc felds of research, dimensionality reduction has become a major part of the preprocessing step in machine learning. Feature selection is one procedure for reducing dimensionality. In this process, instead of using the whole set of features, a subset is selected to be used in the learning model. Feature selection (FS) methods are divided into three main categories: flters, wrappers, and embedded approaches. Filter … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 24 publications
(30 reference statements)
0
1
0
Order By: Relevance
“…The chi-square analysis is a method that is widely used to identify attributes that have the greatest bearing on accounting for a response variable subject to study. In Pourmand et al ( 2021 ), this method is explained as being used to select attributes based on the statistical distribution of data, although, in many applications, the probability distribution function is unknown and difficult to estimate (Berenfeld & Hoffmann, 2021 ). That is why a decision was made to use a tree-based iterative method—the CHAID analysis (Jojoa et al, 2021 )—complemented by a machine learning-based attributed selection, as the idea was to improve generalisation of the results obtained for these types of clinical issues in which attribute selection is of great importance.…”
Section: Discussionmentioning
confidence: 99%
“…The chi-square analysis is a method that is widely used to identify attributes that have the greatest bearing on accounting for a response variable subject to study. In Pourmand et al ( 2021 ), this method is explained as being used to select attributes based on the statistical distribution of data, although, in many applications, the probability distribution function is unknown and difficult to estimate (Berenfeld & Hoffmann, 2021 ). That is why a decision was made to use a tree-based iterative method—the CHAID analysis (Jojoa et al, 2021 )—complemented by a machine learning-based attributed selection, as the idea was to improve generalisation of the results obtained for these types of clinical issues in which attribute selection is of great importance.…”
Section: Discussionmentioning
confidence: 99%