Minimum Phone Error model training on merged acoustic units for transcribing bilingual code-switched speech

Yeh, Ching-Feng; Lin, Yiu-Chang; Lee, Lin-Shan

doi:10.1109/iscslp.2012.6423531

Cited by 3 publications

(2 citation statements)

References 11 publications

(10 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Small corpora have been compiled for English-Spanish [1,2], Cantonese-English [3,4], Hindi-English [5] and for Sepedi-English [6]. However, the language pair English-Mandarin has received by far the most attention [7][8][9][10][11][12][13][14]. Approaches to code-switched language modelling include interpolating n-gram language models (LM) trained on monolingual data [13], n-grams trained on code-switched data [5,7], class-based n-grams using additional features [4], recurrent neural networks [10], and combinations of approaches [11].…”

Section: Introductionmentioning

confidence: 99%

“…However, the language pair English-Mandarin has received by far the most attention [7][8][9][10][11][12][13][14]. Approaches to code-switched language modelling include interpolating n-gram language models (LM) trained on monolingual data [13], n-grams trained on code-switched data [5,7], class-based n-grams using additional features [4], recurrent neural networks [10], and combinations of approaches [11]. A particularly relevant recent study considered features for factored language models for Mandarin-English code-switched speech using the SEAME corpus [12].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Synthesising isiZulu-English Code-Switch Bigrams Using Word Embeddings

Westhuizen

Niesler

2017

Interspeech 2017

View full text Add to dashboard Cite

Code-switching is prevalent among South African speakers, and presents a challenge to automatic speech recognition systems. It is predominantly a spoken phenomenon, and generally does not occur in textual form. Therefore a particularly serious challenge is the extreme lack of training material for language modelling. We investigate the use of word embeddings to synthesise isiZulu-to-English code-switch bigrams with which to augment such sparse language model training data. A variety of word embeddings are trained on a monolingual English web text corpus, and subsequently queried to synthesise code-switch bigrams. Our evaluation is performed on language models trained on a new, although small, English-isiZulu codeswitch corpus compiled from South African soap operas. This data is characterised by fast, spontaneously spoken speech containing frequent code-switching. We show that the augmentation of the training data with code-switched bigrams synthesised in this way leads to a reduction in perplexity.

show abstract