Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181
DOI: 10.1109/icassp.1998.675429
|View full text |Cite
|
Sign up to set email alerts
|

Automatic speech recognition based on cepstral coefficients and a mel-based discrete energy operator

Abstract: In this paper, a novel feature vector based on both Mel Frequency Cepstral Coefficients (MFCCs) and a Mel-based nonlinear Discrete-time Energy Operator (MDEO) is proposed to be used as the input of an HMM-based Automatic Continuous Speech Recognition (ACSR) system. Our goal is to improve the performance of such a recognizer using the new feature vector. Experiments show that the use of the new feature vector increases the recognition rate of the ACSR system. The HTK Hidden Markov Model Toolkit was used through… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 9 publications
0
5
0
Order By: Relevance
“…The energy operator has been applied successfully to demodulation and has many attractive features such as simplicity, efficiency, and adaptability to instantaneous signal variations [3]. The attractive physical interpretation of the energy operator has led to its use as an ASR feature extractor in various forms, see for example [12], [13].…”
Section: Quadratic Operators and Energy Spectrummentioning
confidence: 99%
See 2 more Smart Citations
“…The energy operator has been applied successfully to demodulation and has many attractive features such as simplicity, efficiency, and adaptability to instantaneous signal variations [3]. The attractive physical interpretation of the energy operator has led to its use as an ASR feature extractor in various forms, see for example [12], [13].…”
Section: Quadratic Operators and Energy Spectrummentioning
confidence: 99%
“…In general, using (5) the sum of any quadratic operator output (e.g., see [4], [1]) can be expressed as (12) where are arbitrary constants. For narrowband signals , can be assumed constant around and the short-time average of can be expressed as (13) i.e., the difference between the log of any time-frequency distribution produced by the generalized ASR front-end in Fig.…”
Section: Quadratic Operators and Energy Spectrummentioning
confidence: 99%
See 1 more Smart Citation
“…For resonance signals, the Teager-Kaiser Energy and the nonlinear energy operator Ψ provide a good estimation of the "real" source energy. Recently, Teager energy has been used for speech recognition in [10,12]. In this paper, we extend this work and design a front-end that combines an auditorymotivated filterbank with the Teager energy estimation method.…”
Section: Introductionmentioning
confidence: 94%
“…The Teager energy is a noise robust parameter for speech recognition because the effect of additive noise is attenuated: good results are obtained in presence of car engine noise [20]. The Instantaneous energy reflects only the amplitude of the signal whereas the Teager energy operator reflects the variations in both amplitude and frequency of the signal [45]. Figure 4 is an example of two spectrograms: one based on wavelet coefficients (Coiflet, 5 bands, Teager energy) and the other based on STFT coefficients for the same signal.…”
Section: Energy-based Parametersmentioning
confidence: 99%