2021
DOI: 10.1088/1742-6596/1918/4/042153
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of feature selection using information gain and gain ratio on bank marketing classification using naïve bayes

Abstract: One of the efforts of banks to do marketing is by telephone to offer their products, such as deposits. There are many variables that influence whether the customer decides to subscribe or not. In this study, we present a comparison of feature selection from high features dataset. We use a bank marketing dataset which has 20 features and consists of 4,119 instances. We consider 2 ranking methods entropy-based, namely Information Gain (IG) and Gain Ratio (GR). In our experiment, we classified the various selecte… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 8 publications
0
5
0
Order By: Relevance
“…The Information Gain prefers attributes with more possible values, even though features with fewer values are more informative. The gain ratio (GR), an addition to Information Gain, is used by C4.5, a method that replaces the ID3 algorithm to combat bias [21]. By using the intrinsic information (Entropy) of that characteristic, it corrects Information Gain.…”
Section: )mentioning
confidence: 99%
“…The Information Gain prefers attributes with more possible values, even though features with fewer values are more informative. The gain ratio (GR), an addition to Information Gain, is used by C4.5, a method that replaces the ID3 algorithm to combat bias [21]. By using the intrinsic information (Entropy) of that characteristic, it corrects Information Gain.…”
Section: )mentioning
confidence: 99%
“…In the feature selection process of gain ratio, improvement is achieved by considering intrinsic information of attributes. [5], [7]. In a research conducted by [8] examining the utilization of SVM, IG and IGR, an accuracy of 0.662 and AUC of 0.533 was achieved with IG, an accuracy of 0.691 and AUC of 0.584 with IGR.…”
Section: Who Declared Coronavirus Disease 2019 (Covid-19) a Global Pa...mentioning
confidence: 99%
“…Subsequently, particle velocities are updated using equation ( 6 ), which considers correction factors 𝑐 1 and 𝑐 2 along with random variables 𝑟 1 and 𝑟 2 . Alongside this velocity update, the position of each particle is modified according to equation ( 7 ), ensuring a dynamic exploration of the solution space. These integral steps collectively define the systematic operation of the PSO algorithm in its quest for optimized solutions [25].…”
Section: F Particle Swarm Optimization (Pso)mentioning
confidence: 99%
“…Feature selection gain ratio is used to reduce the dimensionality of the collected data, effectively shortening the process's time [17]. Selected features on Gain Ratio can affect accuracy performance [18].Therefore, this research uses Gain Ratio (GR) as a feature selection to optimize the LVQ 1 algorithm and see the accuracy value obtained using and not using Gain Ratio (GR) feature selection.…”
Section: Introductionmentioning
confidence: 99%