2020
DOI: 10.48550/arxiv.2005.10441
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 14 publications
0
1
0
Order By: Relevance
“…Synthesizing speech from multiple speakers with the use of learnable speaker embeddings has been thoroughly examined from the very start of neural TTS [4] up to most recent efforts [5]. Controlling language with learnable embeddings is also straightforward [6,7] and recently, the concept of metalearning has been shown effective for this purpose [8]. In order to avert the inherent problem of language-dependent speaker representations, domain adaptation has been utilized [9].…”
Section: Introductionmentioning
confidence: 99%
“…Synthesizing speech from multiple speakers with the use of learnable speaker embeddings has been thoroughly examined from the very start of neural TTS [4] up to most recent efforts [5]. Controlling language with learnable embeddings is also straightforward [6,7] and recently, the concept of metalearning has been shown effective for this purpose [8]. In order to avert the inherent problem of language-dependent speaker representations, domain adaptation has been utilized [9].…”
Section: Introductionmentioning
confidence: 99%