IEEE International Conference on Acoustics Speech and Signal Processing 1993
DOI: 10.1109/icassp.1993.319303
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of trajectory and mixture modeling in segment-based word recognition

Abstract: a discussion of our results and possible future work. This paper presents a mechanism for implementing 2. MICROSEGMENT FRAMEWORK mixtures at a phone-subsegment (microsegment) level The framework consists of two levels: the upper level

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

1993
1993
2008
2008

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 3 publications
0
10
0
Order By: Relevance
“…It is worth pointing out another work (Kannan & Ostendorf, 1993) which also models trajectories and uses mixture distributions. The difference is that we define mixtures on the trajectory level, rather than on the state (micro-segment) level (Kannan & Ostendorf, 1993), and in fact we are using a form of context dependence. The context dependence here is data driven, and obtained via clustering (as we use a mixture of models per phoneme), as opposed to being linguistically motivated.…”
Section: Resultsmentioning
confidence: 99%
“…It is worth pointing out another work (Kannan & Ostendorf, 1993) which also models trajectories and uses mixture distributions. The difference is that we define mixtures on the trajectory level, rather than on the state (micro-segment) level (Kannan & Ostendorf, 1993), and in fact we are using a form of context dependence. The context dependence here is data driven, and obtained via clustering (as we use a mixture of models per phoneme), as opposed to being linguistically motivated.…”
Section: Resultsmentioning
confidence: 99%
“…The generalization from the the single-trend model can be viewed as providing discrete-mode distributions on the segment-bound polynomial parameters. 10 Development of this new model is motivated mainly by the observation that contextual and speaker variations bring about widely varying trajectory shapes of the acoustic data in fluent, speakerindependent speech examined in the TIMIT data base. The speech recognition evaluation results we have obtained so far show consistent performance improvement in the recognizer based on the new model.…”
Section: Summary and Discussionmentioning
confidence: 99%
“…Continuous explicit variable duration HMM is adopted in the speech recognition. Compared with standard HMM, results show that the absence of a correct duration model increases the error rate by 50% [4][5][6] . Due to the inherent ambiguity related to the segmentation process in handwritten words, it is a practical idea to use the variable duration model for the states in an HMM based handwritten word recognition (HWR) system [7,8] .…”
Section: Introductionmentioning
confidence: 92%