2018
DOI: 10.1016/j.patrec.2018.01.003
|View full text |Cite
|
Sign up to set email alerts
|

Oversampling imbalanced data in the string space

Abstract: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. Highlights• Oversampling in the string space for a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
21
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 46 publications
(21 citation statements)
references
References 33 publications
0
21
0
Order By: Relevance
“…Undersampling techniques such as random undersampling [4,5], which is the simplest sampling method, attempt to eliminate the harms of the imbalanced distribution by removing the intrinsic examples in the majority class. Contrary to undersampling, oversampling techniques aim to create new minority class examples [6][7][8][9][10]. For example, the random oversampling technique increases the number of examples of the minority class by randomly duplicating the minority class examples, and the synthetic minority oversampling technique (SMOTE) increases the ones by creating new synthetic examples along the line between the minority examples and their selected nearest neighbors [7,8].…”
Section: Introductionmentioning
confidence: 99%
“…Undersampling techniques such as random undersampling [4,5], which is the simplest sampling method, attempt to eliminate the harms of the imbalanced distribution by removing the intrinsic examples in the majority class. Contrary to undersampling, oversampling techniques aim to create new minority class examples [6][7][8][9][10]. For example, the random oversampling technique increases the number of examples of the minority class by randomly duplicating the minority class examples, and the synthetic minority oversampling technique (SMOTE) increases the ones by creating new synthetic examples along the line between the minority examples and their selected nearest neighbors [7,8].…”
Section: Introductionmentioning
confidence: 99%
“…And it can produce a better feature representation by unsupervised feature learning. In view of the second strategy, the data-based method [ 21 24 ] concentrates on generating new samples for minority class (oversampling) or removing samples from majority class (undersampling). Resampling techniques are often employed as a preprocessing process.…”
Section: Introductionmentioning
confidence: 99%
“…A lot of imbalanced learning algorithms have been developed over the past decade. To design algorithms that can deal with the class-imbalance problem, several approaches are widely adopted, such as the resampling approach (Zhu et al, 2017;Castellanos et al, 2018), the cost-sensitive approach (Cheng et al, 2016) and the ensemble approach (Sun et al, 2015;Lusa et al, 2016;Tang and He, 2017;Yuan et al, 2018). Most of imbalanced learning algorithms are designed to solve binary clas-In imbalanced learning, the class-imbalance extent is an important measurement to describe how imbalanced the data are (Ortigosa-Hernández et al, 2017).…”
Section: Introductionmentioning
confidence: 99%