Interspeech 2021 2021
DOI: 10.21437/interspeech.2021-971
|View full text |Cite
|
Sign up to set email alerts
|

GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 17 publications
(11 citation statements)
references
References 0 publications
0
11
0
Order By: Relevance
“…(9) and λ f m is a dynamically scaled scalar computed as λ f m = L recon /L f m following (Yang et al, 2021). Detailed training procedure as well as inference procedure is presented in Appendix B.…”
Section: Training Lossmentioning
confidence: 99%
See 3 more Smart Citations
“…(9) and λ f m is a dynamically scaled scalar computed as λ f m = L recon /L f m following (Yang et al, 2021). Detailed training procedure as well as inference procedure is presented in Appendix B.…”
Section: Training Lossmentioning
confidence: 99%
“…The first counterpart is the representative non-AR TTS model FastSpeech 2 (Ren et al, 2021a). The second model is the GANSpeech model introduced in (Yang et al, 2021). The third model is the DiffSpeech model presented in (Liu et al, 2021a).…”
Section: Experimental Setup For Comparisonmentioning
confidence: 99%
See 2 more Smart Citations
“…Adversarial loss 𝐿𝑎 is used to fool the discriminator by making 𝐶 𝑓 and 𝐹 𝑓 close to 1. Feature matching loss 𝐿 𝑓 is an effective loss function to improve stablity and quality of adversrial training [16,23]…”
Section: Training Algorithmmentioning
confidence: 99%