Ghazi Bouselmi scite author profile

Ghazi Bouselmi

3Publications

52Citation Statements Received

7Citation Statements Given

How they've been cited

How they cite others

Affiliations

Lorraine Research Laboratory in Computer Science and its Applications, French Institute for Research in Computer Science and Automation, French National Centre for Scientific Research

Publications

Order By: Most citations

Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration and Graphemic Constraints

Bouselmi

Fohr

Illina

et al.

View full text Add to dashboard Cite

This paper presents a fully automated approach for the recognition of non-native speech based on acoustic model modification. For a native language (L1) and a spoken language (L2), pronunciation variants of the phones of L2 are automatically extracted from an existing non-native database as a confusion matrix with sequences of phones of L1. This is done using L1's and L2's ASR systems. This confusion concept deals with the problem of non existence of match between some L2 and L1 phones. The confusion matrix is then used to modify the acoustic models (HMMs) of L2 phones by integrating corresponding L1 phone models as alternative HMM paths. We introduce graphemic contraints in the confusion extraction process: the phonetic confusion is established for each couple of 'L2phone' and the grapheme(s) correspondig to that phone. We claim that prononciation errors may depend on the graphemes related to each phone. The modified ASR system achieved an improvement between 32% and 40% (relative, L1=French and L2=English) in WER on the French non-native database used for testing. The introduction of graphemic contraints in the phonetic confusion allowed further improvements.

show abstract

Multilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints

Bouselmi¹,

Fohr²,

Illina³

et al. 2006

View full text Add to dashboard Cite

In this paper we present an automated approach for non-native speech recognition. We introduce a new phonetic confusion concept that associates sequences of native language (NL) phones to spoken language (SL) phones. Phonetic confusion rules are automatically extracted from a non-native speech database for a given NL and SL using both NL's and SL's ASR systems. These rules are used to modify the acoustic models (HMMs) of SL's ASR by adding acoustic models of NL's phones according to these rules. As pronunciation errors that non-native speakers produce depend on the writing of the words, we have also used graphemic constraints in the phonetic confusion extraction process. In the lexicon, the phones in words' pronunciations are linked to the corresponding graphemes (characters) of the word. In this way, the phonetic confusion is established between couples of (SL phones, graphemes) and sequences of NL phones. We evaluated our approach on French, Italian, Spanish and Greek non-native speech databases. The spoken language is English. The modified ASR system achieved significant improvements ranging from 20.3% to 43.2% (relative) in sentence error rate and from 26.6% to 50.0% in WER.

show abstract

Efficient likelihood evaluation and dynamic Gaussian selection for HMM-based speech recognition

Cai

Bouselmi

Laprie

et al. 2009

Computer Speech & Language

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ghazi Bouselmi

Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration and Graphemic Constraints

Multilingual non-native speech recognition using phonetic confusion-based acoustic model modification and graphemic constraints

Efficient likelihood evaluation and dynamic Gaussian selection for HMM-based speech recognition

Contact Info

Product

Resources

About