KES'2000. Fourth International Conference on Knowledge-Based Intelligent Engineering Systems and Allied Technologies. Proceedin
DOI: 10.1109/kes.2000.885793
|View full text |Cite
|
Sign up to set email alerts
|

Modular neural networks exploit multiple front-ends to improve speech recognition systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…One major advantage of the TIMIT task is the availability of the time aligned phonetic transcription which facilitates the AF MLPs training. Furthermore, the TIMIT corpus has been used by many other researchers in the speech processing community [2,4,10] and the performance of our system is comparable to other researchers [5]. While context-dependent phoneme models will give a better performance, our goal here is to demonstrate the usefulness of the fusion and context-independent phoneme model can simplify our experiments significantly.…”
Section: Experiments and Resultsmentioning
confidence: 79%
See 1 more Smart Citation
“…One major advantage of the TIMIT task is the availability of the time aligned phonetic transcription which facilitates the AF MLPs training. Furthermore, the TIMIT corpus has been used by many other researchers in the speech processing community [2,4,10] and the performance of our system is comparable to other researchers [5]. While context-dependent phoneme models will give a better performance, our goal here is to demonstrate the usefulness of the fusion and context-independent phoneme model can simplify our experiments significantly.…”
Section: Experiments and Resultsmentioning
confidence: 79%
“…While a single ASR system can perform quite well, many recent works are focusing on the integration of multiple systems [4,7,8,14,16,17] that result in a better performance than using a single system alone. Almost all of these focused on the integration during different stages of the recognition process, such as at frame level [7], state level [4,16,17] or word level [8]. For some system fusions, combination at the state level has been shown to be more effective [14].…”
Section: Introductionmentioning
confidence: 99%