Forest Sound Classification Dataset: FSC22

Bandara, Meelan; Jayasundara, Roshinie; Ariyarathne, Isuru; Meedeniya, Dulani; Perera, Charith

doi:10.3390/s23042032

Cited by 8 publications

(4 citation statements)

References 54 publications

(82 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Forest Sound Classification dataset (FSC22) [ 19 ] comprises 2025 labeled sound clips in a forest environment. Each audio clip is standardized to a length of 5 s, sampled at a rate of 44.1 kHz, and stored in the WAV file format.…”

Section: Methodsmentioning

confidence: 99%

“…Henceforth, we conduct a comparative analysis of seven CNNs, namely ACDNet, AlexNet, ResNet-50, DenseNet-121, Inception-v3, MobileNet-v3-small, and EfficientNet-v2-B0 to exhibit the state-of-the-art [ 18 ]. The workflow involves the utilization of the FSC22 dataset [ 19 ], which is a dataset specifically created for forest sound data, subjected to preprocessing, followed by successive stages of data augmentation and feature extraction. Subsequently, the CNN models undergo training with k-fold cross-validation.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Comparative Study of Preprocessing and Model Compression Techniques in Deep Learning for Forest Sound Classification

Paranayapa,

Ranasinghe,

Ranmal

et al. 2024

Sensors

Self Cite

View full text Add to dashboard Cite

Deep-learning models play a significant role in modern software solutions, with the capabilities of handling complex tasks, improving accuracy, automating processes, and adapting to diverse domains, eventually contributing to advancements in various industries. This study provides a comparative study on deep-learning techniques that can also be deployed on resource-constrained edge devices. As a novel contribution, we analyze the performance of seven Convolutional Neural Network models in the context of data augmentation, feature extraction, and model compression using acoustic data. The results show that the best performers can achieve an optimal trade-off between model accuracy and size when compressed with weight and filter pruning followed by 8-bit quantization. In adherence to the study workflow utilizing the forest sound dataset, MobileNet-v3-small and ACDNet achieved accuracies of 87.95% and 85.64%, respectively, while maintaining compact sizes of 243 KB and 484 KB, respectively. Henceforth, this study concludes that CNNs can be optimized and compressed to be deployed in resource-constrained edge devices for classifying forest environment sounds.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A Comparative Study of Preprocessing and Model Compression Techniques in Deep Learning for Forest Sound Classification

Paranayapa,

Ranasinghe,

Ranmal

et al. 2024

Sensors

Self Cite

View full text Add to dashboard Cite

show abstract

“…The refined dataset consisted of 1950 audio clips related to forest environments distributed across 26 classes, with each class comprising 75 audio clips. Each audio clip is 5 s long and sampled at a 44.1 kHz sampling rate [ 39 ].…”

Section: Methodsmentioning

confidence: 99%

ESC-NAS: Environment Sound Classification Using Hardware-Aware Neural Architecture Search for the Edge

Ranmal,

Ranasinghe,

Paranayapa

et al. 2024

Sensors

Self Cite

View full text Add to dashboard Cite

The combination of deep-learning and IoT plays a significant role in modern smart solutions, providing the capability of handling task-specific real-time offline operations with improved accuracy and minimised resource consumption. This study provides a novel hardware-aware neural architecture search approach called ESC-NAS, to design and develop deep convolutional neural network architectures specifically tailored for handling raw audio inputs in environmental sound classification applications under limited computational resources. The ESC-NAS process consists of a novel cell-based neural architecture search space built with 2D convolution, batch normalization, and max pooling layers, and capable of extracting features from raw audio. A black-box Bayesian optimization search strategy explores the search space and the resulting model architectures are evaluated through hardware simulation. The models obtained from the ESC-NAS process achieved the optimal trade-off between model performance and resource consumption compared to the existing literature. The ESC-NAS models achieved accuracies of 85.78%, 81.25%, 96.25%, and 81.0% for the FSC22, UrbanSound8K, ESC-10, and ESC-50 datasets, respectively, with optimal model sizes and parameter counts for edge deployment.

show abstract

“…However, these two classes of datasets cannot reflect the real forest acoustic environment with good quality. Recently, a forest sound classification dataset (FSC22) was built [7] containing five classes of sounds that possibly exist in a forest environment.…”

Section: Introductionmentioning

confidence: 99%

Sound classification with time-frequency features in forest environment

Xu,

Chen

2024

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

The study of forest sound classification has drawn more attention recently due to its potential for illegal activities and natural disaster monitoring. Based on the forest sound classification dataset (FSC22), a dataset specific to possible sound existing in the forest, five classification methods are utilized to investigate the relationship between recognition accuracy and the number of sound acoustic features, as well as the number of target classes. The results confirmed that extreme random forest is the best method for forest sound classification, with an accuracy of around 70% when the target class number is above 20. Further, Mel-frequency cepstral coefficients are the critical feature for sound classification, while fuzzy labels in the dataset may reduce the success rate of recognition.

show abstract

Forest Sound Classification Dataset: FSC22

Cited by 8 publications

References 54 publications

A Comparative Study of Preprocessing and Model Compression Techniques in Deep Learning for Forest Sound Classification

A Comparative Study of Preprocessing and Model Compression Techniques in Deep Learning for Forest Sound Classification

ESC-NAS: Environment Sound Classification Using Hardware-Aware Neural Architecture Search for the Edge

Sound classification with time-frequency features in forest environment

Contact Info

Product

Resources

About