2023
DOI: 10.1016/j.csl.2022.101427
|View full text |Cite
|
Sign up to set email alerts
|

Cross-lingual multi-speaker speech synthesis with limited bilingual training data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 26 publications
0
2
0
Order By: Relevance
“…However, this approach is based on a multi-speaker TTS system as opposed to our single-speaker model. Similarly, data augmentation using a voice conversion module was explored in (Cai et al, 2023) and (Ribeiro et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
“…However, this approach is based on a multi-speaker TTS system as opposed to our single-speaker model. Similarly, data augmentation using a voice conversion module was explored in (Cai et al, 2023) and (Ribeiro et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
“…C ROSS-lingual text-to-speech (TTS) [1], [2], [3] refers to the task that requires the system to generate speech in a language foreign to a target speaker. This task has many applications, such as code-mixed speech synthesis for a voice agent, foreign movie dubbing [4], and computer-assisted pronunciation teaching [5].…”
Section: Introductionmentioning
confidence: 99%
“…Voice conversion methods using data augmentation generate parallel data with acoustic features, such as duration, prosody, and energy, similar to the original voice, and then perform parallel voice conversion. Voice conversion methods that utilize nonparallel data based on deep learning have also been studied [18][19][20][21][22]. Other voice conversion methods include text-based approaches.…”
Section: Introductionmentioning
confidence: 99%