2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2012
DOI: 10.1109/icassp.2012.6289015
|View full text |Cite
|
Sign up to set email alerts
|

A first speech recognition system for Mandarin-English code-switch conversational speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
86
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
5
4

Relationship

2
7

Authors

Journals

citations
Cited by 120 publications
(93 citation statements)
references
References 6 publications
0
86
0
Order By: Relevance
“…In addition to the well-established research line in linguistics, implications of CS and other kinds of language switches for speechto-text systems have recently received some research interest, resulting in some robust acoustic modeling [1][2][3][4][5] and language modeling [6][7][8] approaches for CS speech. Language identification (LID) is a relevant task for the automatic speech recognition (ASR) of CS speech [9][10][11][12].…”
Section: Introductionmentioning
confidence: 99%
“…In addition to the well-established research line in linguistics, implications of CS and other kinds of language switches for speechto-text systems have recently received some research interest, resulting in some robust acoustic modeling [1][2][3][4][5] and language modeling [6][7][8] approaches for CS speech. Language identification (LID) is a relevant task for the automatic speech recognition (ASR) of CS speech [9][10][11][12].…”
Section: Introductionmentioning
confidence: 99%
“…That is, when aligning recognition results with the reference transcriptions, insertions, deletions, substitutions were evaluated respectively for each language and summed up for overall evaluation. The basic unit for alignment is character for Mandarin and word for English [3] [12], so the accuracies reported here are with respect to characters for Mandarin and to words for English.…”
Section: Experiments Setupmentioning
confidence: 99%
“…We used the KneserNey tri-gram model, started with a background model and then adapted with the transcription of the training set for the target lecture here. The way the recognition accuracy was evaluated followed the earlier works [3], [12]. That is, when aligning recognition results with the reference transcriptions, insertions, deletions, substitutions were evaluated respectively for each language and summed up for overall evaluation.…”
Section: Experiments Setupmentioning
confidence: 99%
“…They discover that clustering all foreign words into their POS classes leads to the best performance. In (Li et al, 2012;Li et al, 2013), the authors propose to integrate the equivalence constraint into language modeling for Mandarin and English CodeSwitching speech recorded in Hong Kong.…”
Section: Related Workmentioning
confidence: 99%