2018
DOI: 10.3844/jcssp.2018.777.792
|View full text |Cite
|
Sign up to set email alerts
|

An Insight into Rare Class Problem: Analysis and Potential Solutions

Abstract: The class imbalance problem presents an important challenge to the data mining community, in which the number of examples of one class is more than the others. This problem is characterized by a different distribution of cases between all the classes. In this paper, our goal is to study the various challenges of class imbalance problem and provide a comparative study of the current development of research in learning from imbalanced data. We provide a thorough understanding of the nature of the problem, the me… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 53 publications
0
5
0
Order By: Relevance
“…However, it can be readily observed from Figure 2 that the "Red" label heavily dominates the sampled dataset. This so-called "class imbalance problem" has significant adverse effects on any machine learning algorithm [23][24][25][26]. It leads the model to be skewed towards the majority class, creating bias and rendering the algorithm unable to adapt to the features of the minority classes [23,24].…”
Section: Data Preprocessingmentioning
confidence: 99%
“…However, it can be readily observed from Figure 2 that the "Red" label heavily dominates the sampled dataset. This so-called "class imbalance problem" has significant adverse effects on any machine learning algorithm [23][24][25][26]. It leads the model to be skewed towards the majority class, creating bias and rendering the algorithm unable to adapt to the features of the minority classes [23,24].…”
Section: Data Preprocessingmentioning
confidence: 99%
“…From these, in 2,525,847 cases, the excitation resulted in the overturning of the block, while in 3,256,553 it did not. Thus, the class ratio was approximately 56:44, i.e., the dataset did not suffer from the so-called class imbalance problem, which can severely impact the performance of ML algorithms [53,54]. Finally, the maximum seismic rocking rotation was measured, but only for those instances where the block did not overturn.…”
Section: Dataset Descriptionmentioning
confidence: 99%
“…The dataset collected from the commercial bank of Ethiopia was target class imbalanced. According to , Burez et al, (2009) and Maheshwari et al, (2017), the collected dataset must be balanced before training to the proposed model. The synthetic minority oversampling technique (SMOTE), works by nominating related records from the smaller class and altering them one column at a time by a random amount to balance the data.…”
Section: Class Imbalancementioning
confidence: 99%