Liang-Che Sun scite author profile

This paper presents a bilingual acoustic modeling approach for transcribing Mandarin-English code-mixed lectures with highly unbalanced language distribution. Special terminologies for the content were produced in the guest language of English (about 15%) and embedded in the utterances produced in the host lan guage of Mandarin (about 85%). The code-mixing nature of the target corpus and the very small percentage of the English data made the task difficult. State mapping and merging approaches plus three stages of model adaptation handles the above problem. Significant improvements in recognition accuracy were obtained in the experiment with a real bilingual code-mixed lecture corpus recorded at National Taiwan University. The code-mixing situation considered is actually very natural in the spoken language of the daily lives of many people in the globalized world today.

show abstract

Improved tonal language speech recognition by integrating spectro-temporal evidence and pitch information with properly chosen tonal acoustic units

Wang

Sun

et al. 2011

View full text Add to dashboard Cite

Multi-stream spectro-temporal and cepstral features based on data-driven hierarchical phoneme clusters

Sun

Lee

2011

View full text Add to dashboard Cite

We propose a method to enhance multi-stream Gabor and MFCC features using data-driven hierarchical phoneme clusters to yield more discriminating posteriors. We take into account different hierarchy structures, and in addition perform mean and variance normalization. A relative improvement of 11.5% over the conven tional MFCC Tandem system was achieved in experiments con ducted on Mandarin broadcast news. We analyze the complemen tarity between Gabor and MFCC features for different types of phonemes, and investigate the benefits that come from using hie rarchical phoneme clusters.

show abstract

Improved phoneme recognition by integrating evidence from spectro-temporal and cepstral features

Sun²,

Lee

2010

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Liang-Che Sun

Modulation Spectrum Equalization for Improved Robust Speech Recognition

Bilingual acoustic modeling with state mapping and three-stage adaptation for transcribing unbalanced code-mixed lectures

Improved tonal language speech recognition by integrating spectro-temporal evidence and pitch information with properly chosen tonal acoustic units

Multi-stream spectro-temporal and cepstral features based on data-driven hierarchical phoneme clusters

Improved phoneme recognition by integrating evidence from spectro-temporal and cepstral features

Contact Info

Product

Resources

About