2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE) 2020
DOI: 10.1109/icmcce51767.2020.00334
|View full text |Cite
|
Sign up to set email alerts
|

Birdsong Recognition Based on MFCC combined with Vocal Tract Properties

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
0
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 8 publications
0
0
0
Order By: Relevance
“…About the application of MFCC, applying the MFCC to find elements of bird song is one example of how it is frequently used to extract features in speech recognition systems. [17] Since normalizing the values reduces the impact of noise in speech recognition systems, MFCC also has the drawback of having values that are not very stable in the face of additive noise.…”
Section: Feature Extractionmentioning
confidence: 99%
“…About the application of MFCC, applying the MFCC to find elements of bird song is one example of how it is frequently used to extract features in speech recognition systems. [17] Since normalizing the values reduces the impact of noise in speech recognition systems, MFCC also has the drawback of having values that are not very stable in the face of additive noise.…”
Section: Feature Extractionmentioning
confidence: 99%
“…The data curation mainly includes the following steps: (1) original audio preprocessing, which includes resampling, conversion of the original audio format, audio truncation, bird song detection, audio merging, spectrogram generation, accurate data annotation, and data partitioning for training, verification and evaluation; and (2) feature extraction. Using Librosa (https://librosa.org/ (accessed on 20 May 2023)) to extract features and mixing the Mel spectrum and Mel frequency cepstrum coefficient (MFCC) [20] with dynamic features to enhance the effectiveness of bird song features (as shown in Figure 1). Subsequently, the Librosa library is used for the feature extraction of the Mel spectrum and MFCC.…”
Section: Datasetmentioning
confidence: 99%