2021
DOI: 10.48550/arxiv.2107.00308
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

An Objective Evaluation Framework for Pathological Speech Synthesis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 21 publications
0
1
0
Order By: Relevance
“…There are two previous works that focus on VC for clinical usage. The diagram on the left of Figure 1a depicts an N2D VC system presented in [5], which was a combination of a CycleGAN-based frame-wise VC model and a PSOLA-based speech rate modification process. This method suffers from the same issues as those in Section 2.1, including audible vocoder artifacts brought by the extra PSOLA operation, and the inability to preserve the speaker identity of the control speaker.…”
Section: Normal-to-dysarthric Vc For Clinical Usagementioning
confidence: 99%
“…There are two previous works that focus on VC for clinical usage. The diagram on the left of Figure 1a depicts an N2D VC system presented in [5], which was a combination of a CycleGAN-based frame-wise VC model and a PSOLA-based speech rate modification process. This method suffers from the same issues as those in Section 2.1, including audible vocoder artifacts brought by the extra PSOLA operation, and the inability to preserve the speaker identity of the control speaker.…”
Section: Normal-to-dysarthric Vc For Clinical Usagementioning
confidence: 99%