A systematic study of the class imbalance problem in convolutional neural networks

Buda, Mateusz; Maki, Atsuto; Mazurowski, Maciej A.

doi:10.1016/j.neunet.2018.07.011

Cited by 1,937 publications

(1,344 citation statements)

References 46 publications

Supporting

Mentioning

1,201

Contrasting

Unclassified

Order By: Relevance

“…Norouzzadeh et al. () addressed this problem by placing higher weights on rare classes during model training but were not able to systematically improve accuracies for rare species (see Buda, Maki, and Mazurowski () for strategies to address class imbalance in modelling).…”

Section: Discussionmentioning

confidence: 99%

Identifying animal species in camera trap images using deep learning and citizen science

et al. 2018

View full text Add to dashboard Cite

Ecologists often study wildlife populations by deploying camera traps. Large datasets are generated using this approach which can be difficult for research teams to manually evaluate. Researchers increasingly enlist volunteers from the general public as citizen scientists to help classify images. The growing number of camera trap studies, however, makes it ever more challenging to find enough volunteers to process all projects in a timely manner. Advances in machine learning, especially deep learning, allow for accurate automatic image classification. By training models using existing datasets of images classified by citizen scientists and subsequent application of such models on new studies, human effort may be reduced substantially. The goals of this study were to (a) assess the accuracy of deep learning in classifying camera trap data, (b) investigate how to process datasets with only a few classified images that are generally difficult to model, and (c) apply a trained model on a live online citizen science project. Convolutional neural networks (CNNs) were used to differentiate among images of different animal species, images of humans or vehicles, and empty images (no animals, vehicles, or humans). We used four different camera trap datasets featuring a wide variety of species, different habitats, and a varying number of images. All datasets were labelled by citizen scientists on Zooniverse. Accuracies for identifying empty images across projects ranged between 91.2% and 98.0%, whereas accuracies for identifying specific species were between 88.7% and 92.7%. Transferring information from CNNs trained on large datasets (“transfer‐learning”) was increasingly beneficial as the size of the training dataset decreased and raised accuracy by up to 10.3%. Removing low‐confidence predictions increased model accuracies to the level of citizen scientists. By combining a trained model with classifications from citizen scientists, human effort was reduced by 43% while maintaining overall accuracy for a live experiment running on Zooniverse. Ecology researchers can significantly reduce image classification time and manual effort by combining citizen scientists and CNNs, enabling faster processing of data from large camera trap studies.

show abstract

Section: Discussionmentioning

confidence: 99%

Identifying animal species in camera trap images using deep learning and citizen science

et al. 2018

View full text Add to dashboard Cite

show abstract

“…This suggests that the CNN is robust to the class imbalance of the ground‐truth data set, but a more balanced data set could potentially improve outcomes. Of note, the imbalance is relatively small, being only a factor of approximately 2.6, whereas in other domains in which deep learning is been applied the imbalance can be several orders of magnitude …”

Section: Discussionmentioning

confidence: 99%

A convolutional neural network to filter artifacts in spectroscopic MRI

Gurbani

Schreibmann

Maudsley

et al. 2018

Magnetic Resonance in Med

View full text Add to dashboard Cite

show abstract

“…The balanced training set did not get too much better result than the original training set. It is well known that training a network with an unbalanced dataset tends to harm those classes with the least number of examples and benefit those with the most [24], and it is not clear how the imbalanced attribute affects the experimental results. Sometimes balancing the training set improves the accuracy of the classes with fewer examples but harms the success rate for classes with more samples [7].…”

Section: Shape Classificationmentioning

confidence: 99%

An Improved 3D Shape Recognition Method Based on Panoramic View

Zheng

Sun

Zhang

et al. 2018

Mathematical Problems in Engineering

View full text Add to dashboard Cite

Recognition of three-dimensional (3D) shape is a remarkable subject in computer vision systems, because of the lack of excellent shape representations. With the development of 2.5D depth sensors, shape recognition is becoming more important in practical applications. Many methods have been proposed to preprocess 3D shapes, in order to get available input data. A common approach employs convolutional neural networks (CNNs), which have become a powerful tool to solve many problems in the field of computer vision. DeepPano, a variant of CNN, converts each 3D shape into a panoramic view and shows excellent performance. It is worth paying attention to the fact that both serious information loss and redundancy exist in the processing of DeepPano, which limits further improvement of its performance. In this work, we propose a more effective method to preprocess 3D shapes also based on a panoramic view, similar to DeepPano. We introduce a novel method to expand the training set and optimize the architecture of the network. The experimental results show that our approach outperforms DeepPano and can deal with more complex 3D shape recognition problems with a higher diversity of target orientation.

show abstract

A systematic study of the class imbalance problem in convolutional neural networks

Cited by 1,937 publications

References 46 publications

Identifying animal species in camera trap images using deep learning and citizen science

Identifying animal species in camera trap images using deep learning and citizen science

A convolutional neural network to filter artifacts in spectroscopic MRI

An Improved 3D Shape Recognition Method Based on Panoramic View

Contact Info

Product

Resources

About