2023
DOI: 10.1007/s10489-023-04486-8
|View full text |Cite
|
Sign up to set email alerts
|

Identifying bird species by their calls in Soundscapes

Abstract: In many real data science problems, it is common to encounter a domain mismatch between the training and testing datasets, which means that solutions designed for one may not transfer well to the other due to their differences. An example of such was in the BirdCLEF2021 Kaggle competition, where participants had to identify all bird species that could be heard in audio recordings. Thus, multi-label classifiers, capable of coping with domain mismatch, were required. In addition, classifiers needed to be resilie… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 28 publications
0
1
0
Order By: Relevance
“…As many competitions were launched for identifying bird voices, hence, many approaches were proposed for classi cation. The most common approach is using CNN [1] [2]. Some of the approaches use machine learning [3], some use neural networks [4] while some combine machine learning and neural networks like combination of ANN(Arti cial Neural Network) and SVM [5] [6], combination of ANN and KNC(K-Nearest Centroid) where SVM and KNC are used for processing audio signals before feeding them into network.…”
Section: Methodsmentioning
confidence: 99%
“…As many competitions were launched for identifying bird voices, hence, many approaches were proposed for classi cation. The most common approach is using CNN [1] [2]. Some of the approaches use machine learning [3], some use neural networks [4] while some combine machine learning and neural networks like combination of ANN(Arti cial Neural Network) and SVM [5] [6], combination of ANN and KNC(K-Nearest Centroid) where SVM and KNC are used for processing audio signals before feeding them into network.…”
Section: Methodsmentioning
confidence: 99%