IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01.
DOI: 10.1109/asru.2001.1034590
|View full text |Cite
|
Sign up to set email alerts
|

Speech recognition using advanced HMM2 features

Abstract: HMMZ is a particular hidden Markov model where state emission probabilities of the temporal (primary) HMM are modeled through (secondary) state-dependent frequency-based HMMs [IZ]. As shown in [l3], a secondary HMM can also be used to extract robust ASK features. Here, we further investigate this novel approach towards using a full HMMZ as feature extractor, working in the spectral domain, and extracting robust formantlike features for standard ASK system. HMMZ performs a nonlinear, state-dependent frequency w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 9 publications
0
8
0
Order By: Relevance
“…This gives us a two-level HMM: a HMM where each state corresponds to a word, and where the output function is a HMM where each state corresponds to a letter. This relates to two other approaches that we are aware of (Fine et al, 1998) and (Weber et al, 2001).…”
Section: Perplexity Evaluationmentioning
confidence: 79%
“…This gives us a two-level HMM: a HMM where each state corresponds to a word, and where the output function is a HMM where each state corresponds to a letter. This relates to two other approaches that we are aware of (Fine et al, 1998) and (Weber et al, 2001).…”
Section: Perplexity Evaluationmentioning
confidence: 79%
“…Previously, promising results were obtained with both variants of HMM2. In [6], we reported word error rates (WER) of 14.0% (on the clean Numbers95 database, [1]) for variant (a). As described above, here the secondary HMM acted as likelihood estimator.…”
Section: Building From Previous Resultsmentioning
confidence: 99%
“…1, a secondary feature vector as used for the HMM2 system is thus composed of an FF2 coefficient (c s ), its first and second order derivatives (d s and a s ) and a further coefficient reflecting the frequency position of that vector (f s ). Supplementing the 3-dimensional secondary feature vector by such a 'frequency index' has shown significant benefits for speech recognition performance, allowing a better modeling of formant positions (the reader is referred to [6] for more details on the frequency index, its motivations, realization and performance improvements).…”
Section: Features For Hmm2mentioning
confidence: 99%
“…It has been shown that this frequency information improves discrimination between the different phonemes (Weber et al, 2001c). However, the impact of the frequency coefficient is different depending on whether it is treated (1) as an additional feature component (feature combination) or (2) as a second feature stream (likelihood combination).…”
Section: A Hmm2 Design Optionsmentioning
confidence: 99%
“…The subvectors are typically low-dimensional feature vectors, consisting of, for example, a coefficient, its first and second order time derivatives and an additional frequency index (Weber et al, 2001c). If such a temporal feature vector is to be emitted by a specific temporal HMM state, the associated sequence of frequency sub-vectors is emitted by the secondary HMM associated with the corresponding temporal HMM state.…”
Section: The Hmm2 Feature Extractormentioning
confidence: 99%