1993
DOI: 10.1016/0167-6393(93)90027-i
|View full text |Cite
|
Sign up to set email alerts
|

A robust speaker-independent isolated word HMM recognizer for operation over the telephone network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

1997
1997
2001
2001

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 6 publications
0
4
0
Order By: Relevance
“…. From the missing feature theory [31], [32], the probability we want to compute is (10) For the missing observation vector, , we know (11) By substituting (7) and (12) int (10), we finally obtain (12) The transition probabilities have less effect in the Viterbi search than the observation probabilities [33]. Therefore, we can set .…”
Section: B Deletion Of Erased Framesmentioning
confidence: 99%
“…. From the missing feature theory [31], [32], the probability we want to compute is (10) For the missing observation vector, , we know (11) By substituting (7) and (12) int (10), we finally obtain (12) The transition probabilities have less effect in the Viterbi search than the observation probabilities [33]. Therefore, we can set .…”
Section: B Deletion Of Erased Framesmentioning
confidence: 99%
“…) (Song & Samouelian, 1993). The signal was then processed by a 256-point (16 ms) Hamming window with a frame shift of 80 points (5 ms).…”
Section: Mel Frequency Cepstral Coefficientsmentioning
confidence: 99%
“…A conventional HMM recognition system is based on the one described in [3]. with 5 left-to-right states and 6 mixture compo nents per state being used to get a baseline performance.…”
Section: Recognition Phasementioning
confidence: 99%
“…The reason for this is because transitional probabilities of left-to-right HMMs play an insignifi cant role due to their dynamic numerical range compared with local likelihood, thereby they can be ignored altogether without any nota ble effect on recognition accuracy[3].This simple sum-up through the frame-to-frame and state-to-state likelihood propagation in the Viterbi algorithm possesses a very interesting characteristic, i.e. the matching process can be viewed as a sophisticated model dependent transformation, which transforms each acoustic vector XI to a scalar quantity logb , .�i (x t ) .…”
mentioning
confidence: 98%