2020
DOI: 10.48550/arxiv.2011.02128
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis

Abstract: Even though over seven hundred ethnic languages are spoken in Indonesia, the available technology remains limited that could support communication within indigenous communities as well as with people outside the villages. As a result, indigenous communities still face isolation due to cultural barriers; languages continue to disappear. To accelerate communication, speech-to-speech translation (S2ST) technology is one approach that can overcome language barriers. However, S2ST systems require machine translatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 12 publications
(15 reference statements)
0
2
0
Order By: Relevance
“…Therefore, it can be said that when ASR is evaluated from WER, its error rate is higher than one thst evaluated with CER. In previous research (Novitasari et al, 2020), the evaluation used to evaluate the ASR model is CER. This method is used because the language contains some characters outside the standard alphabet.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Therefore, it can be said that when ASR is evaluated from WER, its error rate is higher than one thst evaluated with CER. In previous research (Novitasari et al, 2020), the evaluation used to evaluate the ASR model is CER. This method is used because the language contains some characters outside the standard alphabet.…”
Section: Discussionmentioning
confidence: 99%
“…Research by (Rouditchenko et al, 2023) stated a comparison of performance between the XSL-R and Whisper model in zero-shot conditions ( without fine-tuning) where the evaluation of model performance is lower in less seen or unseen language, which can categorized as low-resource language. There is research (Novitasari et al, 2020) to build an ASR model for ethnic languages in Indonesia. One of the best results is evaluating ASR model performance in recognizing speech in the Javanese language, which is 20.20% in CER evaluation.…”
Section: Introductionmentioning
confidence: 99%