Guoli Ye scite author profile

Guoli Ye

3Publications

2Citation Statements Received

16Citation Statements Given

How they've been cited

How they cite others

Affiliations

Hong Kong University of Science and Technology, University of Hong Kong

Publications

Order By: Most citations

Transition probabilities are more important than we once thought

Chen

Mak

2012

View full text Add to dashboard Cite

It is generally believed that the transition probabilities in a hidden Markov model (HMM) have a limited role in the speech decoding process. In this paper, through a series of recognition experiments on Wall Street Journal (WSJ) read speech and SVitchboard (SVB) conversational telephone speech, we find that the HMM transition probabilities may be more important than we once thought. The experiments include: (1) setting or not setting all outgoing transition probabilities equal; (2) the introduction of word-final triphones and the re-estimation of their transition probabilities; (3) besides grammar factor and insertion penalty, the addition of a third decoding parameter called transition factor to scale the transition probability score during decoding. The results of the above three experiments enable us to improve the the word accuracy of the WSJ and SVB speech recognition task by 0.7% and 5.3% absolute respectively when compared to their baseline model in which all transition probabilities are simply set to 0.5.

show abstract

Subvector-quantized high-density discrete hidden Markov model and its re-estimation

Mak

2010

View full text Add to dashboard Cite

Speaker-ensemble hidden Markov modeling for automatic speech recognition

Mak

2012

View full text Add to dashboard Cite

This paper proposes a new hidden Makov model (HMM) which we call speaker-ensemble HMM (SE-HMM). An SE-HMM is a multi-path HMM in which each path is an HMM constructed from the training data of a different speaker. SE-HMM may be considered a form of template-based acoustic model where speaker-speci¿c acoustic templates are compressed statistically into speaker-speci¿c HMMs. However, one has the Àexibility of building SE-HMM at various level of compression: SE-HMM may be built for a triphone state, a triphone, a whole utterance, or other convenient phonetic units. As a result, SE-HMM contains more details than conventional HMM, but is much smaller than common templatebased acoustic models. Furthermore, the construction of SE-HMM is simple, and since it is still an HMM, its construction and computation is well supported by common HMM toolkits such as HTK. The proposed SE-HMM was evaluated on Resource Management and Wall Street Journal tasks, and it consistently gives better word recognition results than conventional HMM.Index Terms-detailed acoustic modeling, templatebased automatic speech recognition, speaker-ensemble acoustic model

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Guoli Ye

Transition probabilities are more important than we once thought

Subvector-quantized high-density discrete hidden Markov model and its re-estimation

Speaker-ensemble hidden Markov modeling for automatic speech recognition

Contact Info

Product

Resources

About