2014
DOI: 10.32614/rj-2014-008
|View full text |Cite
|
Sign up to set email alerts
|

ROSE: a Package for Binary Imbalanced Learning

Abstract: The ROSE package provides functions to deal with binary classification problems in the presence of imbalanced classes. Artificial balanced samples are generated according to a smoothed bootstrap approach and allow for aiding both the phases of estimation and accuracy evaluation of a binary classifier in the presence of a rare class. Functions that implement more traditional remedies for the class imbalance and different metrics to evaluate accuracy are also provided. These are estimated by holdout, bootstrap, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
210
0
5

Year Published

2017
2017
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 385 publications
(240 citation statements)
references
References 4 publications
1
210
0
5
Order By: Relevance
“…Further details on how to select the smoothing matrix Hj can be found in Menardi and Torelli () and Lunardon et al . ().…”
Section: Simulation Based On the Taiwan Credit Card Data Setsupporting
confidence: 92%
See 2 more Smart Citations
“…Further details on how to select the smoothing matrix Hj can be found in Menardi and Torelli () and Lunardon et al . ().…”
Section: Simulation Based On the Taiwan Credit Card Data Setsupporting
confidence: 92%
“…In particular, we ‘balanced’ the data by using random oversampling examples (ROSE, by Menardi and Torelli () and Lunardon et al . ()) and the synthetic minority oversampling technique SMOTE (by Chawla et al . ()) methods for various EPV‐values and develop prediction models by using logistic regression.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to address the problem of class imbalance, two remedial measures will be applied in the case studies. More specifically, SMOTE and ROSE sampling (Lunardon et al, 2014) can be selected to create a more balanced training dataset. In short, SMOTE draws artificial samples by choosing points that lie on the line connecting the rare observation to one of its nearest neighbours in the feature space.…”
Section: Intrinsic Properties Of Machine Learning Techniquesmentioning
confidence: 99%
“…We aimed to collect about 25 cases of interest within each participant. Since this data was still comprised of substantially more non-lapse than lapse cases (unbalanced data), non-lapse cases were randomly sampled in a 1:1 ratio to lapse cases via the ROSE package in the R statistical computing software (e.g., see [49, 50]). This 1:1 sampling resulted in oversampling of the lapse-cases and undersampling of the non-lapse cases in order to create a balanced data set, which is suggested for optimal classification accuracy in these types of models [51].…”
Section: Problems and Future Considerationsmentioning
confidence: 99%