Sixth International Conference on Machine Learning and Applications (ICMLA 2007) 2007
DOI: 10.1109/icmla.2007.76
|View full text |Cite
|
Sign up to set email alerts
|

Learning with limited minority class data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
48
0
1

Year Published

2013
2013
2018
2018

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 83 publications
(49 citation statements)
references
References 11 publications
0
48
0
1
Order By: Relevance
“…obtaining the same cardinality in classes) is not optimal when dealing with such rare classes. For instance, the reader can consult the comprehensive study with many data sets and classifiers showing that depending on combination of data and classifiers the ratios of modified majority vs. minority class cardinalities like 3:1 and 2:1 quite often outperformed the most popular ratio 1:1 [33] Therefore, researchers proposed more elaborated methods that attempt at taking into account data characteristics and factors influencing nature of class imbalance.…”
Section: Improving Classifiers By Focused Re-sampling Methodsmentioning
confidence: 99%
“…obtaining the same cardinality in classes) is not optimal when dealing with such rare classes. For instance, the reader can consult the comprehensive study with many data sets and classifiers showing that depending on combination of data and classifiers the ratios of modified majority vs. minority class cardinalities like 3:1 and 2:1 quite often outperformed the most popular ratio 1:1 [33] Therefore, researchers proposed more elaborated methods that attempt at taking into account data characteristics and factors influencing nature of class imbalance.…”
Section: Improving Classifiers By Focused Re-sampling Methodsmentioning
confidence: 99%
“…Another feature of oversampling to consider is the class distribution ratio. Khoshgoftaar et al (2007) reported that an even distribution is not always optimal when dealing with data rarity. To ensure a more representative minority class, clusters could be identified in the minority class from which to sample the data.…”
Section: Discussionmentioning
confidence: 99%
“…To get reasonably balanced data for classification, we make the target ration for NFP and FP module as recommended to be 65% and 35%, respectively by Khoshgoftaar et al [10]. The DB is done on the selected features only.…”
Section: The Proposed Framework For Software Fault Predictionmentioning
confidence: 99%
“…Hence, they came up with their proposed over-sampling approach, in which the minority class is over-sampled by creating "synthetic" examples rather than by over-sampling with replacement. In conclusion to get reasonably balanced data for classification, we make the target ration for NFP and FP module as recommended to be 65% and 35%, respectively by [10].…”
Section: Data Balancingmentioning
confidence: 99%
See 1 more Smart Citation