Abstractvarious experiments have conclusively shown that superior continuous speech recognition performance is obtained when using context-dependent phonemic models. However, we have observed that using an explicit context-dependent phonemic model can yield many transcriptions for a single lexicon entry. In this work, we study the compression of the word transcription dictionaries (WTD) into a more compact form to balance the need between flexibility and reliability. Based on a measure of a likelihood function, a statistical model for an automatic procedure to compress a WTD is developed. The compressed dictionary is then used for sentence recognition in a continuous speech recognition system. Experimental results indicate a substantial improvement of the recognition rate after compression.
IntroductionSeveral continuous speech recognition systems currently being developed use context-dependent models which seek to capture the pronunciation variations resulting from phonetic context effects. It has been observed that with these models, large vocabulary speech recognition systems usually necessitates to compress, from some standard references, a word phonetic transcription dictionary (WTD) [i, 2, 3, 4]. Such a dictionary generally gives a single transcription for a lexicon entry. Continuous speech recognition systems have shown satisfactory results when using dictionaries prepared in this way [3,5,6,7]. However, because of the large variations in the pronouneiation of a given word [8, 9, 10, 11, 12, 13, ], it is very difficult, when compressing a dictionary, to capture its most representative variant. Word transcriptions are first obtained from the context-dependent phonemic model CODEPHON-STM based on tile automatically expending speed and context (AESC) approach developed in our laboratory [11,4], and then compressed into a more compact form to balance the need between flexibility and reliability using an automatic procedure. In fact, the direct word transcriptions given by CODEPHON-STM can yield many transcriptions for a single lexicon entry.