1994
DOI: 10.1109/89.260358
|View full text |Cite
|
Sign up to set email alerts
|

A hybrid segmental neural net/hidden Markov model system for continuous speech recognition

Abstract: Absfrucf-The current state-of-the-art in large-vocabulary, continuous speech recognition is based on the use of hidden Markov models (HMM). In an attempt to improve over HMM performance, we developed a hybrid system that combines the advantages of neural networks and HMM using a multiple hypothesis (or N-best) paradigm. The connectionist component of the system, the segmental neural net (SNN), models all the frames of a phonetic segment simultaneously, thus overcoming the well-known conditional-independence li… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0
1

Year Published

1996
1996
2011
2011

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 59 publications
(27 citation statements)
references
References 26 publications
0
26
0
1
Order By: Relevance
“…The idea, proposed in [130], is developed in [132], where a connectionist approach is applied to the problem of rescoring the hypothesis generated by an HMM which uses an N-best strategy. In this case the network does not compute scores on individual acoustic frames, but on whole segments (sub-sequencies) of frames, corresponding to phonemes.…”
Section: Other Approachesmentioning
confidence: 99%
See 2 more Smart Citations
“…The idea, proposed in [130], is developed in [132], where a connectionist approach is applied to the problem of rescoring the hypothesis generated by an HMM which uses an N-best strategy. In this case the network does not compute scores on individual acoustic frames, but on whole segments (sub-sequencies) of frames, corresponding to phonemes.…”
Section: Other Approachesmentioning
confidence: 99%
“…Di!erent neural architectures were tried (1-layer perceptron, MLP, HyperBF) [132] without signi"cant #uctuations in performance. They were also combined altogether to obtain a more robust rescoring process.…”
Section: Other Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…The most popular solution for avoiding the problems associated with P (S|A) is to run a frame-based (e.g. HMM) recognizer, and re-score only the N best paths by the segmental phoneme models [11]. We, however, wanted to model P (S|A) with discriminative classifiers, for which we trained segmental probabilities P (s i |A i ).…”
Section: Segment-based Recognitionmentioning
confidence: 99%
“…MLP-based posteriors have also been used to re-score hypothesis in continuous speech recognition [4].…”
Section: Introductionmentioning
confidence: 99%