2023
DOI: 10.3390/s23042032
|View full text |Cite
|
Sign up to set email alerts
|

Forest Sound Classification Dataset: FSC22

Abstract: The study of environmental sound classification (ESC) has become popular over the years due to the intricate nature of environmental sounds and the evolution of deep learning (DL) techniques. Forest ESC is one use case of ESC, which has been widely experimented with recently to identify illegal activities inside a forest. However, at present, there is a limitation of public datasets specific to all the possible sounds in a forest environment. Most of the existing experiments have been done using generic enviro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 54 publications
(82 reference statements)
0
3
0
Order By: Relevance
“…The Forest Sound Classification dataset (FSC22) [ 19 ] comprises 2025 labeled sound clips in a forest environment. Each audio clip is standardized to a length of 5 s, sampled at a rate of 44.1 kHz, and stored in the WAV file format.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The Forest Sound Classification dataset (FSC22) [ 19 ] comprises 2025 labeled sound clips in a forest environment. Each audio clip is standardized to a length of 5 s, sampled at a rate of 44.1 kHz, and stored in the WAV file format.…”
Section: Methodsmentioning
confidence: 99%
“…Henceforth, we conduct a comparative analysis of seven CNNs, namely ACDNet, AlexNet, ResNet-50, DenseNet-121, Inception-v3, MobileNet-v3-small, and EfficientNet-v2-B0 to exhibit the state-of-the-art [ 18 ]. The workflow involves the utilization of the FSC22 dataset [ 19 ], which is a dataset specifically created for forest sound data, subjected to preprocessing, followed by successive stages of data augmentation and feature extraction. Subsequently, the CNN models undergo training with k-fold cross-validation.…”
Section: Introductionmentioning
confidence: 99%
“…The refined dataset consisted of 1950 audio clips related to forest environments distributed across 26 classes, with each class comprising 75 audio clips. Each audio clip is 5 s long and sampled at a 44.1 kHz sampling rate [ 39 ].…”
Section: Methodsmentioning
confidence: 99%
“…However, these two classes of datasets cannot reflect the real forest acoustic environment with good quality. Recently, a forest sound classification dataset (FSC22) was built [7] containing five classes of sounds that possibly exist in a forest environment.…”
Section: Introductionmentioning
confidence: 99%