From Natural to Artificial Intelligence - Algorithms and Applications 2018
DOI: 10.5772/intechopen.80419
|View full text |Cite
|
Sign up to set email alerts
|

Some Commonly Used Speech Feature Extraction Algorithms

Abstract: Speech is a complex naturally acquired human motor ability. It is characterized in adults with the production of about 14 different sounds per second via the harmonized actions of roughly 100 muscles. Speaker recognition is the capability of a software or hardware to receive speech signal, identify the speaker present in the speech signal and recognize the speaker afterwards. Feature extraction is accomplished by changing the speech waveform to a form of parametric representation at a relatively minimized data… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 54 publications
(8 citation statements)
references
References 30 publications
0
8
0
Order By: Relevance
“…Mel-frequency cepstral coefficients (MFCCs) are widely used to extract features for voice-based authentication [ 13 , 14 , 15 , 16 , 17 , 18 ]. MFCCs are obtained by extracting features from the audio signal, and when used as input to the base model, they produce much better performance than when directly considering raw audio signals as input.…”
Section: Literature Reviewsmentioning
confidence: 99%
“…Mel-frequency cepstral coefficients (MFCCs) are widely used to extract features for voice-based authentication [ 13 , 14 , 15 , 16 , 17 , 18 ]. MFCCs are obtained by extracting features from the audio signal, and when used as input to the base model, they produce much better performance than when directly considering raw audio signals as input.…”
Section: Literature Reviewsmentioning
confidence: 99%
“…Autocorrelation coefficients are aliased in conventional linear prediction. The susceptibility of LPC estimates to quantization noise is high, so they are not well suited for generalization [19].…”
Section: Introductionmentioning
confidence: 99%
“…It can derive information from latent signals in both the time and frequency domains at the same time. Many wavelets are orthogonal, which is an outstanding feature for compact signal representation [19] [20]. The wavelet transform breaks down a signal into a set of simple functions known as wavelets.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…To extract the vocal features like human ear the algorithm should replicate the human acoustics. MFCC, LPC, Linear Prediction Cepstral Coefficients (LPCC), Linear Spectral Frequencies (LSF), Perceptual Linear prediction (PLP) imitate the human hearing and speaking tract and give relevant features [10]. MFCC filters frequencies linearly at low frequencies and logarithmically at high frequencies to preserve the phonetically vital properties of the speech signal.…”
Section: Introductionmentioning
confidence: 99%