2001
DOI: 10.1016/s0167-6393(00)00048-0
|View full text |Cite
|
Sign up to set email alerts
|

Time and frequency filtering of filter-bank energies for robust HMM speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
98
0

Year Published

2003
2003
2019
2019

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 122 publications
(122 citation statements)
references
References 36 publications
0
98
0
Order By: Relevance
“…Such processing is already under study in auditory neurophysiology (Kowalski et al, 1996a,b;Depireux et al, 2001;Miller et al, 2001Miller et al, , 2002Escabí and Schreiner, 2002) and psychoacoustics (Chi et al, 1999), and is also being investigated for various signal-processing tasks, including audio coding (Atlas and Shamma, 2003;Klein et al, 2003) and speech recognition (Hermmansky, 1999;Nadeu et al, 2001;Kleinschmidt and Gelbart, 2002;Kleinschmidt, 2002).…”
Section: The Linear Processing Of Spectrotemporal Modulation Frequenciesmentioning
confidence: 99%
“…Such processing is already under study in auditory neurophysiology (Kowalski et al, 1996a,b;Depireux et al, 2001;Miller et al, 2001Miller et al, , 2002Escabí and Schreiner, 2002) and psychoacoustics (Chi et al, 1999), and is also being investigated for various signal-processing tasks, including audio coding (Atlas and Shamma, 2003;Klein et al, 2003) and speech recognition (Hermmansky, 1999;Nadeu et al, 2001;Kleinschmidt and Gelbart, 2002;Kleinschmidt, 2002).…”
Section: The Linear Processing Of Spectrotemporal Modulation Frequenciesmentioning
confidence: 99%
“…It is to be noted that FF-features have previously been shown to yield similar recognition performance as mel-frequency cepstral coefficients (Nadeu et al, 2001). The FFfeatures were obtained with the following parameter set-up: frames of 32 ms length with a 10 ms shift between the frames were used; both preemphasis and Hamming window were applied to each frame; the short-time magnitude spectra, obtained by applying the FFT, was passed to Mel-spaced filter-bank analysis with 20 channels; the obtained logarithm filter-bank energies were then filtered using the filter H(z)=z-z −1 (Nadeu et al, 2001). A feature vector consisting of 18 elements was obtained (the edge values were excluded).…”
Section: Acoustic Modellingmentioning
confidence: 93%
“…The frequency-filtered logarithm filter-bank energies (Nadeu et al, 2001) (referred here as FF-features) were used as speech feature representation due to their suitability for missing-feature based recognition. It is to be noted that FF-features have previously been shown to yield similar recognition performance as mel-frequency cepstral coefficients (Nadeu et al, 2001).…”
Section: Acoustic Modellingmentioning
confidence: 99%
See 1 more Smart Citation
“…In this paper, we focus on ASR systems that use Frequency Filtered (FF) parameters (Nadeu et al (1995(Nadeu et al ( , 2001); Paliwal (1999)). This parameterization performs as well as the parameterizations in the cepstral domain such as the Mel-frequency cepstral coefficients (MFCC) and has the additional advantage of staying in the log-frequency domain.…”
Section: Introductionmentioning
confidence: 99%