2007
DOI: 10.3844/jcssp.2007.608.616
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of Speech Features on the Speech Recognition Task

Abstract: In the present work we overview some recently proposed discrete Fourier transform (DFT)- and discrete wavelet packet transform (DWPT)-based speech parameterization methods and evaluate their performance on the speech recognition task. Specifically, in order to assess the practical value of these less studied speech parameterization methods, we evaluate them in a common experimental setup and compare their performance against traditional techniques, such as the Mel-frequency cepstral coefficients (MFCC) and per… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0
1

Year Published

2011
2011
2020
2020

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 37 publications
(21 citation statements)
references
References 21 publications
(32 reference statements)
0
19
0
1
Order By: Relevance
“…Literature states that MFCC is most popular method because of its speech recognition performance [2]. The recognition performance mainly depends upon the choice of features and its extraction from the speech signal such that it gives a high recognition rate with less amount of computation [3]. This paper is organized as follows: Section 2 gives a brief idea about the feature extraction techniques.…”
Section: Introductionmentioning
confidence: 99%
“…Literature states that MFCC is most popular method because of its speech recognition performance [2]. The recognition performance mainly depends upon the choice of features and its extraction from the speech signal such that it gives a high recognition rate with less amount of computation [3]. This paper is organized as follows: Section 2 gives a brief idea about the feature extraction techniques.…”
Section: Introductionmentioning
confidence: 99%
“…En [1] se muestra una comparativa de distintos métodos de parametrización de voz, teniendo como base de comparación los Coeficientes Cepstrales en la Frecuencia de Mel, siendo estos los más usados debido a su bajo costo computacional y a su robustez. Con base en los resultados, se decidió analizar estas técnicas dentro del reconocimiento del habla robusto a cambios de tono en la voz, utilizando técnicas con distintas transformaciones, como la Transformada Discreta de Fourier (DFT) y la Transformada Discreta de Paquetes Wavelet (DWPT).…”
Section: Introductionunclassified
“…This helps in providing a closer response of human auditory system than the linearly spaced frequency bands which are derived from FFT (Fast Fourier Transform) and DCT [7] (Discrete Cosine Transform). Thus a mel frequency cepstrum results in more accurate processing of data.…”
Section: Mel Cepstrum Analysismentioning
confidence: 99%