2021
DOI: 10.1016/j.ins.2021.02.056
|View full text |Cite
|
Sign up to set email alerts
|

A cluster-based oversampling algorithm combining SMOTE and k-means for imbalanced medical data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
27
0
4

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 113 publications
(31 citation statements)
references
References 33 publications
0
27
0
4
Order By: Relevance
“…The SMOTEC algorithm [ 11 ] first uses the modified SMOTE (synthetic minority oversampling technique) method to oversample a small number of class instances in the training dataset to increase the number of minority class samples. Then it uses the SVM feature to design a clustering algorithm to clean the data set after oversampling.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…The SMOTEC algorithm [ 11 ] first uses the modified SMOTE (synthetic minority oversampling technique) method to oversample a small number of class instances in the training dataset to increase the number of minority class samples. Then it uses the SVM feature to design a clustering algorithm to clean the data set after oversampling.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…The weights were adaptively determined according to the number of majority data around minority ones. Xu et al 32 introduced K ‐means clustering to SMOTE, with the purpose of distinguishing noise, overlapping, and boundary samples. Krawczyk et al 33 proposed multiclass radial‐based oversampling to learn the data distribution of minority samples by radial basis function (RBF).…”
Section: Related Workmentioning
confidence: 99%
“…Traditional AI classifiers are vulnerable in learning highly skewed data as they are designed to expect different classes to contribute equally to minimization of the classifiers' loss functions [34]. One of the well-known oversampling techniques called Synthetic Minority Over-sampling Technique (SMOTE) was proposed in [35] which formed the basis of many other oversampling techniques for classification such as hybrid K-means SMOTE [36] and SMOTE combined with self-organizing maps [34]. The main reason which hinders the application of oversampling for regression problems, is the determination of the target output of the oversampled datapoints.…”
Section: ) Refinement Of the Ann Architecturementioning
confidence: 99%