Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
DOI: 10.1109/icassp.2005.1415165
|View full text |Cite
|
Sign up to set email alerts
|

MFCC Compensation for Improved Recognition of Filtered and Band-Limited Speech

Abstract: This paper addresses the problem of bandwidth expansion for the purpose of robust speech recognition. We show that an HMM-based ASR engine trained with full spectrum range data (0-8kHz) can successfully perform speech recognition tasks over band-filtered test data compensated by means of a series of simple MFCC parameter corrector functions. The problem is important when ASR is employed for audio streams of unknown frequency bandwidth common in spoken document retrieval. Evaluation is based on recognition rate… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
6
0

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 8 publications
1
6
0
Order By: Relevance
“…This technique is used to mainly analyze and extract the characteristic of the excitation state. [8].…”
Section: Sub Segmental Analysismentioning
confidence: 99%
See 1 more Smart Citation
“…This technique is used to mainly analyze and extract the characteristic of the excitation state. [8].…”
Section: Sub Segmental Analysismentioning
confidence: 99%
“…The main disadvantage of statistical models is that they must take priori modeling assumptions which are answerable to be inaccurate, handicapping the system performance. In recent years, a new approach to the challenging problem of conversational speech recognition has emerged, holding a promise to overcome some fundamental limitations of the conventional Hidden Markov Model (HMM) approach (Bridle et al, 1998 [8]; Ma and Deng, 2004 [9]). This new approach is a radical departure from the current HMM-based statistical modeling approaches.…”
Section: Vistatistical Based Approachesmentioning
confidence: 99%
“…The algorithm presented in this paper is an evolution of our previous work on polynomial corrector functions as a means of compensation for MFCC parameters [8]. Stereo data (fulland restricted-bandwidth versions of train files) are used to obtain polynomial functions that map the restricted-bandwidth realizations to those in the full-bandwidth space; we call them Polynomial Corrector Functions (PCFs).…”
Section: Previous Workmentioning
confidence: 99%
“…The expansion of the spectral envelope is a more challenging task and strictly depends on the features that estimate the spectral envelope. In the BWE procedure, spectral envelope information is usually represented as a set of cepstral coefficients [2], linear predictive coding (LPC) coefficients [4], line spectral frequency coefficients [5][6][7], mel-frequency cepstral coefficients (MFCCs) [8][9][10], a set of autocorrelation coefficients [11], or melspectrum coefficients [12]. The codebook technique is the fundamental envelope prediction method.…”
Section: Introductionmentioning
confidence: 99%