2020
DOI: 10.5281/zenodo.3607820
|View full text |Cite
|
Sign up to set email alerts
|

jameslyons/python_speech_features: release v0.6.1

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
6
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…where 𝑁 𝑓𝑏 is the total number of filters (usually 40) and 𝑁 𝑚𝑓𝑐𝑐 is the number of selected coefficients (usually 13). Among the most notable are Slaney's Auditory toolbox [31], Voicebox for MATLAB [32], and James Lyon's Python_speech_features GitHub resources [33]. In this study, we used Auditory Toolbox for MFCC and Mel filter bank computation.…”
Section: Human Speech Modelsmentioning
confidence: 99%
See 1 more Smart Citation
“…where 𝑁 𝑓𝑏 is the total number of filters (usually 40) and 𝑁 𝑚𝑓𝑐𝑐 is the number of selected coefficients (usually 13). Among the most notable are Slaney's Auditory toolbox [31], Voicebox for MATLAB [32], and James Lyon's Python_speech_features GitHub resources [33]. In this study, we used Auditory Toolbox for MFCC and Mel filter bank computation.…”
Section: Human Speech Modelsmentioning
confidence: 99%
“…Many libraries have been developed to extract the speech features for Mel, MFCC, PLP, LPC, and other filter banks. Among the most notable are Slaney's Auditory toolbox[31], Voicebox for MATLAB[32], and James Lyon's Python_speech_features GitHub resources[33]. In this study, we used Auditory Toolbox for MFCC and Mel filter bank computation.The human vocal tract can be simulated using formulas of air flow inside tubes with some simplifications.…”
mentioning
confidence: 99%
“…Many libraries have been developed to extract the speech features for Mel, MFCC, PLP, LPC, and other filter banks. Among the most notable are Slaney's Auditory toolbox [32], Voicebox for MATLAB [33], and James Lyon's Python_speech_features GitHub resources [34]. In this study, we used Auditory Toolbox for MFCC and Mel filter bank computation.…”
Section: Human Speech Modelsmentioning
confidence: 99%
“…From each audio recording, we extracted 13 melfrequency cepstral coefficients (MFCC 0-12) with a window length of 25 ms and step size of 10 ms using the python_speech_features library. 34 Mel-frequency cepstral coefficients (MFCCs) have been widely used in both speaker recognition, 35 SER, 36 and depression detection, 37 and have several desirable properties such as being independent of the energy of the acoustic signal and robustness across genders. 38,39 MFCCs represent movements of the vocal tract and are designed to mimic how the human ear perceives sounds by having high resolution in the lower frequencies and less in higher frequencies.…”
Section: Feature Extractionmentioning
confidence: 99%