2018
DOI: 10.1016/j.specom.2018.10.006
|View full text |Cite
|
Sign up to set email alerts
|

Semi-supervised acoustic model training for speech with code-switching

Abstract: In the FAME! project, we aim to develop an automatic speech recognition (ASR) system for Frisian-Dutch code-switching (CS) speech extracted from the archives of a local broadcaster with the ultimate goal of building a spoken document retrieval system. Unlike Dutch, Frisian is a low-resourced language with a very limited amount of manually annotated speech data. In this paper, we describe several automatic annotation approaches to enable using of a large amount of raw bilingual broadcast data for acoustic model… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
25
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 21 publications
(25 citation statements)
references
References 64 publications
0
25
0
Order By: Relevance
“…We refer to this automatically transcribed data as the 'Frisian Broadcast' data. The automatic transcription procedure is detailed in [17].…”
Section: Acoustic Datamentioning
confidence: 99%
See 1 more Smart Citation
“…We refer to this automatically transcribed data as the 'Frisian Broadcast' data. The automatic transcription procedure is detailed in [17].…”
Section: Acoustic Datamentioning
confidence: 99%
“…The acoustic data augmentation relies on available monolingual acoustic resources from the high-resourced mixed language (Dutch). Using more monolingual Dutch speech for acoustic model training has found to be effective in improving the general ASR performance, only after increasing the in-domain CS data applying the semi-supervised techniques described in [17,21].…”
Section: Introductionmentioning
confidence: 99%
“…This project developed an ASR system for Frisian–Dutch code-switching speech, as extracted from the archives of a local broadcaster. The goal of the system was to allow automatically retrieving relevant items from a large collection of news broadcasts, in response to user-specified text queries (Yilmaz et al, 2018 , p. 12). Similarly, Van Den Heuvel et al ( 2012 ) report applying ASR to disclose—via keyword retrieval−250 interviews with veterans of Dutch conflicts and military missions.…”
Section: Speech Technology To the Rescue?mentioning
confidence: 99%
“…Emre Yılmaz et al [22] developed a semi supervised acoustic model for speech recognition. This developed model assigns language label to speech signals.…”
Section: Types Of Asr Systemsmentioning
confidence: 99%