2012
DOI: 10.1109/tasl.2011.2177820
|View full text |Cite
|
Sign up to set email alerts
|

Voice Conversion Using Dynamic Frequency Warping With Amplitude Scaling, for Parallel or Nonparallel Corpora

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
74
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 101 publications
(74 citation statements)
references
References 22 publications
0
74
0
Order By: Relevance
“…In particular, the ML-GMM method is a well-established baseline method in the voice conversion research. In the frequency warping methods, the weighted frequency warping with amplitude scaling (WFW-AS) has been reported to achieve comparable performance to ML-GMM in terms of speaker similarity [39]. Hence, ML-GMM, DKPLS, and WFW-AS could be good choices to simulate voice conversion spoofing attacks when the training data are limited, although not all of them have been applied to spoofing attacks.…”
Section: Discussionmentioning
confidence: 99%
“…In particular, the ML-GMM method is a well-established baseline method in the voice conversion research. In the frequency warping methods, the weighted frequency warping with amplitude scaling (WFW-AS) has been reported to achieve comparable performance to ML-GMM in terms of speaker similarity [39]. Hence, ML-GMM, DKPLS, and WFW-AS could be good choices to simulate voice conversion spoofing attacks when the training data are limited, although not all of them have been applied to spoofing attacks.…”
Section: Discussionmentioning
confidence: 99%
“…As alternatives to data-driven statistical conversion methods, frequency warping based approaches to voice conversion were introduced in (Toda et al, 2001;Sundermann and Ney, 2003;Erro et al, 2010;Godoy et al, 2012;Erro et al, 2013). Rather than directly substituting the spectral characteristics of the input speech signal, these techniques effectively warp the frequency axis of a source spectrum to match that of the target.…”
Section: Voice Conversionmentioning
confidence: 99%
“…Frequency warping approaches tend to retain spectral details and produce high quality converted speech. A so-called Gaussian-dependent filtering approach to voice conversion introduced in (Matrouf et al, 2006;Bonastre et al, 2007) is related to amplitude scaling (Godoy et al, 2012) within a frequency warping framework.…”
Section: Voice Conversionmentioning
confidence: 99%
“…We conducted subjective quality evaluations in a format similar to multi-stimulus test with hidden reference and anchor (MUSHRA) [32]. The listeners were presented with four test signals: (a) a hidden reference-the target speaker, (b) enhanced JGMM, (c) CGMM, and (d) En-GB.…”
Section: Subjective Evaluationsmentioning
confidence: 99%