2018
DOI: 10.15676/ijeei.2018.10.3.2
|View full text |Cite
|
Sign up to set email alerts
|

The Impact of Data Re-Sampling on Learning Performance of Class Imbalanced Bankruptcy Prediction Models

Abstract: The aim of this paper is to evaluate the effect of data sampling techniques on the performance of learners using real highly imbalanced Spanish bankruptcy dataset. The class imbalance problem refers to the highly uneven distribution of class instances where one class is having most of the instances than others. In the presence of highly skewed data distribution, the performance of classical learners is heavily biased in recognizing the majority class and consequently leads to the performance degradation of qua… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 34 publications
0
5
0
1
Order By: Relevance
“…Terdapat sejumlah teknik untuk menangani kumpulan kelas yang tidak seimbang, baik pada tingkat data maupun algoritma. Pada tingkat data, teknik yang diadopsi secara luas adalah resampling seperti oversampling [14,15] dan undersampling [16]. Teknik-teknik ini memodifikasi probabilitas sebelumnya dari kelas mayoritas dan minoritas dalam kumpulan data pelatihan untuk mendapatkan jumlah kasus yang lebih seimbang di setiap kelas.…”
Section: Pendahuluanunclassified
“…Terdapat sejumlah teknik untuk menangani kumpulan kelas yang tidak seimbang, baik pada tingkat data maupun algoritma. Pada tingkat data, teknik yang diadopsi secara luas adalah resampling seperti oversampling [14,15] dan undersampling [16]. Teknik-teknik ini memodifikasi probabilitas sebelumnya dari kelas mayoritas dan minoritas dalam kumpulan data pelatihan untuk mendapatkan jumlah kasus yang lebih seimbang di setiap kelas.…”
Section: Pendahuluanunclassified
“…The Synthetic Minority Oversampling Technique (SMOTE) [10] is an oversampling technique that adds new synthetic data to minority classes to balance them with the majority class sample. The parameters used are the percentage of minority classes that are exceeded, the total number of minority class data, and data parameters that state the value of the nearest neighbor of the minority class to the majority class.…”
Section: B Synthetic Minority Oversampling Technique (Smote)mentioning
confidence: 99%
“…Undersampling is also a technique for the process of resampling with imbalanced data. A portion of the majority class sample is removed to balance it with the minority sample [10]. Mathematically, the undersampling method can be explained through the below equation [9] :…”
Section: Undersamplingmentioning
confidence: 99%
See 1 more Smart Citation
“…A study proposed the variants of SMOTEBoost for imbalanced regression task and evaluated its performance using 30 datasets (Moniz et al, 2018). Studies have also evaluated the effect of different data sampling methods on learning performance of individual and ensemble models using highly skewed bankruptcy and credit card fraud datasets (Sisodia & Verma, 2018;Sisodia et al, 2018a). The above-discussed ensemble methods perform better on some skewed dataset with small class IR but show poor performance on datasets with high IR.…”
Section: Related Workmentioning
confidence: 99%