2022
DOI: 10.1016/j.knosys.2022.108592
|View full text |Cite
|
Sign up to set email alerts
|

Resampling algorithms based on sample concatenation for imbalance learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(11 citation statements)
references
References 38 publications
0
7
0
Order By: Relevance
“…Unlike other clustering-based over-sampling methods, the proposed approach applies modified density peaks clustering rather than traditional k-means clustering techniques to cluster the minority instances due to its capability of accurately identifying sub-clusters with different sizes and densities, which is beneficial for the proposed method to simultaneously accommodate for between-class and within-class imbalance issues caused by various reasons. Shi Hongbo et al [ 15 ].proposed the Re-SC model. Re-SC transforms an imbalanced training dataset in the original sample space into a concatenated dataset in a new sample space.…”
Section: Related Workmentioning
confidence: 99%
“…Unlike other clustering-based over-sampling methods, the proposed approach applies modified density peaks clustering rather than traditional k-means clustering techniques to cluster the minority instances due to its capability of accurately identifying sub-clusters with different sizes and densities, which is beneficial for the proposed method to simultaneously accommodate for between-class and within-class imbalance issues caused by various reasons. Shi Hongbo et al [ 15 ].proposed the Re-SC model. Re-SC transforms an imbalanced training dataset in the original sample space into a concatenated dataset in a new sample space.…”
Section: Related Workmentioning
confidence: 99%
“…Some classification methods, such as Support Vector Machines (SVMs), apply kernel functions 1 to solve the linearity issue, yet, the lack of the data persists and penalizes classifiers' performances. Other studies, such as Shi et al (2022), used resampling algorithms based on sample concatenation (Re-SC). Re-SC transforms an imbalanced training dataset in the original sample space into a concatenated dataset in a new sample space.…”
Section: Class Overlapping and Within Class Imbalancementioning
confidence: 99%
“…𝑆 is inspired by the sample size formula in statistics (Cochran, 1977), in which πœ– is the acceptable tolerance error that can be adjusted as required, 𝑍 𝛼 is the critical value of the 𝑍 test at the significance level 𝛼 (Shi et al, 2022), πœ™ is an indicator function, 4 and 𝑝 is the variance of a proportion denoting the percentage of a sample having a particular characteristic, 5 and π‘˜ β€² is a hyper-parameter related to the size of clusters. Here, 𝑝 denotes the proportion of objects in the given cluster  𝑖 belonging to the minority class .…”
Section: Extraction and Classification Of Representative Pointsmentioning
confidence: 99%
See 2 more Smart Citations