2019
DOI: 10.3844/jcssp.2019.886.929
|View full text |Cite
|
Sign up to set email alerts
|

A Wide Scale Classification of Class Imbalance Problem and its Solutions: A Systematic Literature Review

Abstract: In today's world, most of the data (real world) is present in imbalanced form by nature. This is because of not having efficient algorithms to put this data (i.e., generated data by billion of internetconnected devices (IoTs)) in respective format. Imbalanced data poses a great challenge to (both) data mining and machine learning algorithms. The imbalanced dataset consists of a majority class and a minority class, where the majority class takes the lead over the minority class. Generally, several standard lear… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 24 publications
(10 citation statements)
references
References 126 publications
0
7
0
Order By: Relevance
“…Therefore, when constructing the dataset, we thoroughly considered this factor and chose the method of sample size balance. Many approaches to overcoming the class imbalance problem have been proposed [57,58]. The most commonly used methods involve the implementation of various class balancing algorithms, oversampling (such as SMOTE) [59,60], undersampling, cost-sensitive learning [61,62], or ensemble methods [63][64][65] tailored for imbalanced datasets; these are used to solve the problem of the uneven data scale distribution of different dominant lithologies in the dataset.…”
Section: Discussion Of Research Limitationsmentioning
confidence: 99%
“…Therefore, when constructing the dataset, we thoroughly considered this factor and chose the method of sample size balance. Many approaches to overcoming the class imbalance problem have been proposed [57,58]. The most commonly used methods involve the implementation of various class balancing algorithms, oversampling (such as SMOTE) [59,60], undersampling, cost-sensitive learning [61,62], or ensemble methods [63][64][65] tailored for imbalanced datasets; these are used to solve the problem of the uneven data scale distribution of different dominant lithologies in the dataset.…”
Section: Discussion Of Research Limitationsmentioning
confidence: 99%
“…The most common restriction observed in our study is the insignificant quantity of data instances. Excluding the size of data, the quality of the dataset along with the cautious dimension reduction techniques [37,38] and data balancing [39,40] approaches play a significant role in effective cancer prediction results.…”
Section: Discussionmentioning
confidence: 99%
“…In ALT, traditional classification algorithms are modified to deal with unbalanced datasets either by modifying cost or weights. Finally, the ELT that combines the performances of multiple classifiers to make predictions [ 15 , 16 ]. This section briefly reviews a few recent works that deal with imbalanced datasets.…”
Section: Related Workmentioning
confidence: 99%