2004
DOI: 10.1111/j.0824-7935.2004.t01-1-00228.x
|View full text |Cite
|
Sign up to set email alerts
|

A Multiple Resampling Method for Learning from Imbalanced Data Sets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
378
0
9

Year Published

2006
2006
2022
2022

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 858 publications
(410 citation statements)
references
References 13 publications
3
378
0
9
Order By: Relevance
“…No oversampling techniques were examimed in this study. While it is known that for a few imbalanced datasets oversampling has performed satisfactorily (Japkowicz and Stephen, 2002;Estabrooks and Japkowicz, 2004), in many other cases undersampling proves to be superior to oversampling (Domingos, 1999;Drummond and Holte, 2003).…”
Section: Class Distribution and Classification Performancementioning
confidence: 99%
“…No oversampling techniques were examimed in this study. While it is known that for a few imbalanced datasets oversampling has performed satisfactorily (Japkowicz and Stephen, 2002;Estabrooks and Japkowicz, 2004), in many other cases undersampling proves to be superior to oversampling (Domingos, 1999;Drummond and Holte, 2003).…”
Section: Class Distribution and Classification Performancementioning
confidence: 99%
“…In general, it is unclear which approach is more effective and there have been attempts to combine them (Estabrooks et al, 2004). Another main approach is to attempt to modify the sensitivity of the classification algorithm so that errors on minority class to be costlier than errors on majority class (Veropoulos et al, 1999).…”
Section: Introductionmentioning
confidence: 99%
“…The problematic consequences thus are different. [23][24][25] Undersampling reduces the imbalanced ratio by randomly removing the majority examples and thus may lead to the loss of information about the majority class. Oversampling increases the size of the minority class by randomly duplicating the minority examples which may cause over fitting.…”
Section: Data Preprocessing Approachesmentioning
confidence: 99%