2015
DOI: 10.1109/tkde.2014.2324567
|View full text |Cite
|
Sign up to set email alerts
|

RACOG and wRACOG: Two Probabilistic Oversampling Techniques

Abstract: As machine learning techniques mature and are used to tackle complex scientific problems, challenges arise such as the imbalanced class distribution problem, where one of the target class labels is under-represented in comparison with other classes. Existing oversampling approaches for addressing this problem typically do not consider the probability distribution of the minority class while synthetically generating new samples. As a result, the minority class is not well represented which leads to high misclas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
52
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 105 publications
(52 citation statements)
references
References 38 publications
(41 reference statements)
0
52
0
Order By: Relevance
“…Most of review paper and application work are based on the "classical" resampling techniques and do not take new resampling techniques into account. In this paper, we briefly review six powerful oversampling approaches, including both "classical" ones (SMOTE, ADASYN, MWMOTE) and new ones (RACOG, wRACOG, RWO-Sampling) [2,3,5,7,24]. The six reviewed oversampling techniques can be divided into two groups according to whether they consider the overall minority class distribution.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Most of review paper and application work are based on the "classical" resampling techniques and do not take new resampling techniques into account. In this paper, we briefly review six powerful oversampling approaches, including both "classical" ones (SMOTE, ADASYN, MWMOTE) and new ones (RACOG, wRACOG, RWO-Sampling) [2,3,5,7,24]. The six reviewed oversampling techniques can be divided into two groups according to whether they consider the overall minority class distribution.…”
Section: Related Workmentioning
confidence: 99%
“…Hence, the global information of the minority samples cannot be guaranteed. In order to tackle this problem, Das et al [5] proposed RACOG (RApidy COnverging Gibbs) and wRACOG (Wrapper-based RApidy COnverging Gibbs).…”
Section: Mwmotementioning
confidence: 99%
See 1 more Smart Citation
“…The first feature is the imbalanced distribution, which commonly exists in classification domains, such as spotting unreliable telecommunication customers, text classification, and detection of fraudulent telephone calls [26,27]. Certain solutions at the data and algorithmic levels are proposed for the classimbalance problem [28][29][30]. It is also a problem in sparse approximation.…”
Section: The Characteristic Of Industrial Data Setmentioning
confidence: 99%
“…These additional features are called latent features and the associated feature space as latent space. RACOG and wRACOG [24] both generate instances for the minority class by considering joint probability distribution of data features. The probability distribution of the minority class is learnt using a dependence tree algorithm and the Gibbs sampler is used to generate instances from the distribution.…”
Section: Introductionmentioning
confidence: 99%