ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022
DOI: 10.1109/icassp43922.2022.9746278
|View full text |Cite
|
Sign up to set email alerts
|

DGC-Vector: A New Speaker Embedding for Zero-Shot Voice Conversion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…However, GAN-based models are usually hard to train. Disentanglement-based approaches such as [12,13,14,15,16] aim to split the speech into spoken content and speaker characteristic (i.e. timbre).…”
Section: Introductionmentioning
confidence: 99%
“…However, GAN-based models are usually hard to train. Disentanglement-based approaches such as [12,13,14,15,16] aim to split the speech into spoken content and speaker characteristic (i.e. timbre).…”
Section: Introductionmentioning
confidence: 99%
“…Speaker embedding methods, such as DGC-VECTOR [7], AutoVC [8], SEVC [9], YourTTS [10] IZSVC [11] and VoiceLoop [12], use a generation process conditioned on speaker embedding. During training, these embeddings are calculated for the training set.…”
Section: Introductionmentioning
confidence: 99%