2016 International Conference on Communication and Signal Processing (ICCSP) 2016
DOI: 10.1109/iccsp.2016.7754521
|View full text |Cite
|
Sign up to set email alerts
|

A new approach for handling imbalanced dataset using ANN and genetic algorithm

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0
2

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(10 citation statements)
references
References 3 publications
0
6
0
2
Order By: Relevance
“…As observed from the class feature distribution plot in figure 6.5, this dataset is highly imbalanced. Synthetic Minority Oversampling Technique (SMOTE) is employed to tackle this problem (Sonak et al, 2016) There was one feature (cd_000) that had a single value for all data (standard deviation = 0). Since it will not add much value to our model performance, it can be removed.…”
Section: Feature Engineeringmentioning
confidence: 99%
“…As observed from the class feature distribution plot in figure 6.5, this dataset is highly imbalanced. Synthetic Minority Oversampling Technique (SMOTE) is employed to tackle this problem (Sonak et al, 2016) There was one feature (cd_000) that had a single value for all data (standard deviation = 0). Since it will not add much value to our model performance, it can be removed.…”
Section: Feature Engineeringmentioning
confidence: 99%
“…Secara teknis, kumpulan suatu data dikatakan tidak seimbang jika distribusi antar kelas di dalam dataset tidak merata atau seragam. Dalam kondisi seperti ini satu kelas dataset yang digambarkan hanya oleh sejumlah kecil contoh atau kelas minoritas dan kelas lain membentuk sebagian besar data atau kelas mayoritas [13]. Akibatnya, sering hasil prediksi cenderung kepada dataset yang mayoritas.…”
Section: Pendahuluanunclassified
“…Pertama adalah dengan cara pendekatan level data (Sampling) yaitu pengambilan ulang sampel data untuk mengubah data yang tidak seimbang menjadi seimbang. Pendekatan level data (Sampling) dapat digunakan untuk modifikasi distribusi kelas dari data latih untuk menyeimbangkan data [13], pendekatan level data itu sendiri adalah tahapan preprocessing yang dilakukan sebelum membuat pemodelan machine learning [12]. Pendekatan kedua adalah dengan cara penyesuaian Cost Sensitive pada data aslinya [13], Cost Sensitive merupakan pembelajaran machine learning dalam mempertimbangkan kesalahan klasifikasi [13].…”
Section: Pendahuluanunclassified
“…Class imbalance problems have gained significant attention in the ML community recently [32,33]. Kotsiantis et al [34] used different tools and techniques to handle class imbalance and Sonak et al [35] analyzed several different methods of class imbalance problems. Classification becomes more cumbersome as data size increases, due to unbounded and unbalanced data quality.…”
Section: Background and Related Workmentioning
confidence: 99%