Proceedings of the Workshop on Speech and Natural Language - HLT '89 1989
DOI: 10.3115/100964.100990
|View full text |Cite
|
Sign up to set email alerts
|

SRI's DECIPHER system

Abstract: SRI has developed a speaker-independent continuous speech, large vocabulary speech recognition system, DECIPHER, that provides state-of-theart performance on the DARPA standard speakerindependent resource management training and testing materials. SRI's approach is to integrate speech and linguistic knowledge into the HMM framework. This paper describes performance improvements arising from detailed phonological modeling and from the incorporation of crossword coarticulatory constraints.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
3
1
1

Year Published

1989
1989
2019
2019

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(6 citation statements)
references
References 9 publications
1
3
1
1
Order By: Relevance
“…The results listed in Table I are approximately the same as those achieved by more conventional systems tested on the same data [13,14,15,16] and the perplexity 60 grammar. Given the difficulty of the task and the early stage of development of this system, however, we consider these results quite respectable.…”
Section: Interpretation Of the Resultssupporting
confidence: 60%
See 1 more Smart Citation
“…The results listed in Table I are approximately the same as those achieved by more conventional systems tested on the same data [13,14,15,16] and the perplexity 60 grammar. Given the difficulty of the task and the early stage of development of this system, however, we consider these results quite respectable.…”
Section: Interpretation Of the Resultssupporting
confidence: 60%
“…The word accuracy of our system is not as good as that obtained on exactly the same data by several other conventional systems [13,14,15,16]. However, we believe that a few correctable shortcomings of the existing system are responsible for the disparity.…”
Section: Introductioncontrasting
confidence: 68%
“…[4], because this design can satisfy the requirements of trainable and stability at the same time. A typical speech recognition front end [5] is shown in Figure 3. Here, the speech signal is filtered, windowed, and then transform coded, typically with mel-frequency weighted cepstrals.…”
Section: Speech Recognition Component In Pervasive Hmi Layermentioning
confidence: 99%
“…A number of sites have already moved in this direction by changing from contextindependent phone (monophone) models to left and right context dependent phone (triphone) models [18]. h further step along this line has been the inclusion of crossword triphone models [7,10,13] which has minimized the ability of the acoustic models to learn the bigram language model. 1 These changes have improved recognition performance when trained and tested on the same database, but their effects on vocabulary independence have not been tested.…”
Section: Els [5]mentioning
confidence: 99%