Abstract:In the present work we overview some recently proposed discrete Fourier transform (DFT)- and discrete wavelet packet transform (DWPT)-based speech parameterization methods and evaluate their performance on the speech recognition task. Specifically, in order to assess the practical value of these less studied speech parameterization methods, we evaluate them in a common experimental setup and compare their performance against traditional techniques, such as the Mel-frequency cepstral coefficients (MFCC) and per… Show more
“…Literature states that MFCC is most popular method because of its speech recognition performance [2]. The recognition performance mainly depends upon the choice of features and its extraction from the speech signal such that it gives a high recognition rate with less amount of computation [3]. This paper is organized as follows: Section 2 gives a brief idea about the feature extraction techniques.…”
This paper presents robust feature extraction techniques for isolated word recognition under noisy conditions. The proposed hybrid feature extraction techniques are Bark Frequency Cepstral Coefficients (BFCC) and Weighted Average Mel-Frequency Cepstral Coefficient (WMFCC). Both methods are tested in various noisy environments using a single Gaussian Hidden Markov Model (HMM) based isolated digit recognition system. The results clearly indicates that WMFCC performed well compared to Mel-Frequency Cepstral Coefficient (MFCC) in noisy environment using NOISEX-92 database.
“…Literature states that MFCC is most popular method because of its speech recognition performance [2]. The recognition performance mainly depends upon the choice of features and its extraction from the speech signal such that it gives a high recognition rate with less amount of computation [3]. This paper is organized as follows: Section 2 gives a brief idea about the feature extraction techniques.…”
This paper presents robust feature extraction techniques for isolated word recognition under noisy conditions. The proposed hybrid feature extraction techniques are Bark Frequency Cepstral Coefficients (BFCC) and Weighted Average Mel-Frequency Cepstral Coefficient (WMFCC). Both methods are tested in various noisy environments using a single Gaussian Hidden Markov Model (HMM) based isolated digit recognition system. The results clearly indicates that WMFCC performed well compared to Mel-Frequency Cepstral Coefficient (MFCC) in noisy environment using NOISEX-92 database.
“…En [1] se muestra una comparativa de distintos métodos de parametrización de voz, teniendo como base de comparación los Coeficientes Cepstrales en la Frecuencia de Mel, siendo estos los más usados debido a su bajo costo computacional y a su robustez. Con base en los resultados, se decidió analizar estas técnicas dentro del reconocimiento del habla robusto a cambios de tono en la voz, utilizando técnicas con distintas transformaciones, como la Transformada Discreta de Fourier (DFT) y la Transformada Discreta de Paquetes Wavelet (DWPT).…”
Palabras clave: Reconocimiento del hablante con cambios en la voz, coeficientes cepstrales en la frecuencia de Mel, parámetros cepstrales basados en sub-banda, espectrograma, máquina de soporte vectorial.
“…This helps in providing a closer response of human auditory system than the linearly spaced frequency bands which are derived from FFT (Fast Fourier Transform) and DCT [7] (Discrete Cosine Transform). Thus a mel frequency cepstrum results in more accurate processing of data.…”
Speech is most natural way of interaction for human. It has broad applications in the human-machine and humancomputer interaction. This paper reviews the literature and the technological aspects of human-machine interaction through various speech recognition approaches. It also discusses the various techniques used in each step of a speech recognition process and attempts to analyze an approach for designing an efficient system for speech recognition. It also discusses that how this system works and its application in various areas.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.