2021
DOI: 10.1016/j.asoc.2020.107033
|View full text |Cite
|
Sign up to set email alerts
|

EUSC: A clustering-based surrogate model to accelerate evolutionary undersampling in imbalanced classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
7
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 25 publications
(7 citation statements)
references
References 63 publications
0
7
0
Order By: Relevance
“…The data imbalance problem reduces the effectiveness of deep learning fault diagnosis [ 12 , 13 ]. To reduce the impact of imbalanced data on model performance of deep learning, data-level and algorithm-level approaches are proposed [ 14 , 15 , 16 , 17 , 18 ]. Among the data-level methods, oversampling and undersampling techniques are used to construct a balanced dataset [ 14 , 15 , 16 ].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The data imbalance problem reduces the effectiveness of deep learning fault diagnosis [ 12 , 13 ]. To reduce the impact of imbalanced data on model performance of deep learning, data-level and algorithm-level approaches are proposed [ 14 , 15 , 16 , 17 , 18 ]. Among the data-level methods, oversampling and undersampling techniques are used to construct a balanced dataset [ 14 , 15 , 16 ].…”
Section: Introductionmentioning
confidence: 99%
“…To reduce the impact of imbalanced data on model performance of deep learning, data-level and algorithm-level approaches are proposed [ 14 , 15 , 16 , 17 , 18 ]. Among the data-level methods, oversampling and undersampling techniques are used to construct a balanced dataset [ 14 , 15 , 16 ]. However, oversampling can produce duplicate information, while undersampling can lead to a loss of information.…”
Section: Introductionmentioning
confidence: 99%
“…The advantage of RUS is that it can quickly train the classification model, but it may eliminate useful data 15 . Another widely used undersampling method is based on clustering 16,17 . Cluster‐based methods preserve the data distribution characteristics while retaining the useful samples 16,18 .…”
Section: Introductionmentioning
confidence: 99%
“…15 Another widely used undersampling method is based on clustering. 16,17 Cluster-based methods preserve the data distribution characteristics F I G U R E 1 Data and its classification hyperplane while retaining the useful samples. 16,18 The cluster-based method divides the majority class into multiple groups, and then selects representative samples from every group.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, the interest in using EAs to address machine learning problems is growing fastly [19]- [29]. For imbalanced learning, EAs have been used for data sampling [30], [31] and cost-sensitive learning [32]. Although recent studies address the problem of determining the optimal misclassification costs [32], [33], they have paid little attention to considering the hyper-parameters of the learning algorithm, along with exploiting the hierarchical nature of parameter and hyper-parameter learning to guide search.…”
Section: Introductionmentioning
confidence: 99%