2021
DOI: 10.1186/s13040-021-00283-6
|View full text |Cite
|
Sign up to set email alerts
|

Gaussian noise up-sampling is better suited than SMOTE and ADASYN for clinical decision making

Abstract: Clinical data sets have very special properties and suffer from many caveats in machine learning. They typically show a high-class imbalance, have a small number of samples and a large number of parameters, and have missing values. While feature selection approaches and imputation techniques address the former problems, the class imbalance is typically addressed using augmentation techniques. However, these techniques have been developed for big data analytics, and their suitability for clinical data sets is u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 34 publications
(15 citation statements)
references
References 34 publications
0
15
0
Order By: Relevance
“…Upsampled patients were selected randomly from each profile and across features Gaussian noise was added to the observations (+/- 1 SD ). Upsampling with Gaussian noise was shown to be especially suitable for clinical data sets ( 45 ). As a result, no class imbalance was present for multi-class and OVO.…”
Section: Methodsmentioning
confidence: 99%
“…Upsampled patients were selected randomly from each profile and across features Gaussian noise was added to the observations (+/- 1 SD ). Upsampling with Gaussian noise was shown to be especially suitable for clinical data sets ( 45 ). As a result, no class imbalance was present for multi-class and OVO.…”
Section: Methodsmentioning
confidence: 99%
“…Adding white or coloured noise to a weak signal can sometimes, paradoxically, increase its detectability through the process of stochastic resonance [127]. It can also be used as an up-sampling method of data augmentation to enhance classification in machine learning [128]. Different coloured and white noises can now be added to signals in CEPS, or used as stand-alone data to test how noises affect the different measures.…”
Section: Adding Noise ('Add Noise')mentioning
confidence: 99%
“…The ADASYN algorithm is an improvement of SMOTE algorithm, which was first proposed by Haibo He in 2008. [ 13 ] Based on the data distribution, it can make an adaptive decision on the number of minority samples while minimizing the learning bias, and pay special attention to difficult‐to‐learn samples. The steps of new samples generation are as follows: Step 1: Calculate the imbalance ratio of sample category: d=mnormalg/mnormalp …”
Section: Preliminariesmentioning
confidence: 99%