6th Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU 2018) 2018
DOI: 10.21437/sltu.2018-27
|View full text |Cite
|
Sign up to set email alerts
|

Code-Switching Detection with Data-Augmented Acoustic and Language Models

Abstract: In this paper, we investigate the code-switching detection performance of a code-switching (CS) automatic speech recognition (ASR) system with data-augmented acoustic and language models. We focus on the recognition of Frisian-Dutch radio broadcasts where one of the mixed languages, namely Frisian, is under-resourced. Recently, we have explored how the acoustic modeling (AM) can benefit from monolingual speech data belonging to the high-resourced mixed language. For this purpose, we have trained state-of-the-a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 24 publications
0
7
0
Order By: Relevance
“…Firstly, in terms of data augmentation, we boost the size of CS corpus by synthetically generating CS text using a well-trained long short-term memory (LSTM) language model. Similar techniques are also proposed in [27,28]. However, in [27] a sentence level aligned parallel corpus is available, thus synthetic CS data can be generated based on word or phrase alignment between the parallel sentences and guided by linguistic rules.…”
Section: Cs Rnn Language Modelingmentioning
confidence: 99%
“…Firstly, in terms of data augmentation, we boost the size of CS corpus by synthetically generating CS text using a well-trained long short-term memory (LSTM) language model. Similar techniques are also proposed in [27,28]. However, in [27] a sentence level aligned parallel corpus is available, thus synthetic CS data can be generated based on word or phrase alignment between the parallel sentences and guided by linguistic rules.…”
Section: Cs Rnn Language Modelingmentioning
confidence: 99%
“…Moreover, we have created CS text, which is almost nonexistent, in multiple ways providing perplexity reductions on the development and test set transcriptions. The data-augmented models have been shown to provide better CS detection in terms of equal error rate, but have the tendency to hypothesize much more language switches compared to the human annotations [21].…”
Section: Baseline Approach: Time Alignment Of Cs Asr Outputmentioning
confidence: 99%
“…In our latest work, we have investigated this code-switching detection performance using data-augmented CS ASR systems and observed that this technique suffers from over-switching due to numerous very short spurious language switches [21]. To cope with shortcoming, this paper introduces a new method for code-switching detection which uses frame-level language posteriors which are created using a CS ASR system.…”
Section: Introductionmentioning
confidence: 99%
“…To alleviate the problem of code-switching data shortage, monolingual data and data augment technology are usually used [10,11,12,13,14]. For the monolingual, some E2E models have achieved great performance with large-scale data.…”
Section: Introductionmentioning
confidence: 99%