2022
DOI: 10.48550/arxiv.2204.10561
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Speaking-Rate-Controllable HiFi-GAN Using Feature Interpolation

Abstract: This paper presents a speaking-rate-controllable HiFi-GAN neural vocoder. Original HiFi-GAN is a high-fidelity, computationally efficient, and tiny-footprint neural vocoder. We attempt to incorporate a speaking rate control function into HiFi-GAN for improving the accessibility of synthetic speech. The proposed method inserts a differentiable interpolation layer into the HiFi-GAN architecture. A signal resampling method and an image scaling method are implemented in the proposed method to warp the mel-spectrog… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 30 publications
(52 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?