Intelligent Control and Automation
DOI: 10.1007/978-3-540-37256-1_89
|View full text |Cite
|
Sign up to set email alerts
|

Under-Sampling Approaches for Improving Prediction of the Minority Class in an Imbalanced Dataset

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
25
0

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 104 publications
(25 citation statements)
references
References 6 publications
0
25
0
Order By: Relevance
“…Alternative approaches that could help balancing datasets include undersampling of the bigger class [32,33] or oversampling of the smaller class. [34] The extended discussion of these approaches is beyond the scope of this review.…”
Section: Dataset Size and Balancingmentioning
confidence: 99%
“…Alternative approaches that could help balancing datasets include undersampling of the bigger class [32,33] or oversampling of the smaller class. [34] The extended discussion of these approaches is beyond the scope of this review.…”
Section: Dataset Size and Balancingmentioning
confidence: 99%
“…Besides, if a positive instance is misclassified by its three nearest neighbors, then the algorithm also eliminates the neighbors that belong to the majority class. A quite different alternative corresponds to under-Sampling Based on Clustering (SBC) [22], which rests on the idea that there may exist different clusters in a given data set, and each cluster may have distinct characteristics depending on the ratio of the number of positive examples to the number of negative examples in the cluster. Thus the SBC algorithm first gathers all examples in the data set into some clusters, and then determines the number of negative cases that will be randomly picked up.…”
Section: Under-samplingmentioning
confidence: 99%
“…This occurs when there are many more instances in one class than others in a training collection. In this case, a classifier usually tends to predict samples from the majority class and completely ignore the minority class [37]. For this reason, we applied an approach to subsampling (reducing the number of samples that have the majority class) to generate a balanced collection composed of:…”
Section: Features Selection Algorithms Studymentioning
confidence: 99%