2018
DOI: 10.1016/j.neunet.2018.07.011
|View full text |Cite
|
Sign up to set email alerts
|

A systematic study of the class imbalance problem in convolutional neural networks

Abstract: In this study, we systematically investigate the impact of class imbalance on classification performance of convolutional neural networks (CNNs) and compare frequently used methods to address the issue. Class imbalance is a common problem that has been comprehensively studied in classical machine learning, yet very limited systematic research is available in the context of deep learning. In our study, we use three benchmark datasets of increasing complexity, MNIST, CIFAR-10 and ImageNet, to investigate the eff… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

15
1,201
2
4

Year Published

2018
2018
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 1,937 publications
(1,344 citation statements)
references
References 46 publications
15
1,201
2
4
Order By: Relevance
“…Norouzzadeh et al. () addressed this problem by placing higher weights on rare classes during model training but were not able to systematically improve accuracies for rare species (see Buda, Maki, and Mazurowski () for strategies to address class imbalance in modelling).…”
Section: Discussionmentioning
confidence: 99%
“…Norouzzadeh et al. () addressed this problem by placing higher weights on rare classes during model training but were not able to systematically improve accuracies for rare species (see Buda, Maki, and Mazurowski () for strategies to address class imbalance in modelling).…”
Section: Discussionmentioning
confidence: 99%
“…This suggests that the CNN is robust to the class imbalance of the ground‐truth data set, but a more balanced data set could potentially improve outcomes. Of note, the imbalance is relatively small, being only a factor of approximately 2.6, whereas in other domains in which deep learning is been applied the imbalance can be several orders of magnitude …”
Section: Discussionmentioning
confidence: 99%
“…The balanced training set did not get too much better result than the original training set. It is well known that training a network with an unbalanced dataset tends to harm those classes with the least number of examples and benefit those with the most [24], and it is not clear how the imbalanced attribute affects the experimental results. Sometimes balancing the training set improves the accuracy of the classes with fewer examples but harms the success rate for classes with more samples [7].…”
Section: Shape Classificationmentioning
confidence: 99%