2023
DOI: 10.1109/tcsvt.2022.3210002
|View full text |Cite
|
Sign up to set email alerts
|

Audio-Driven Dubbing for User Generated Contents via Style-Aware Semi-Parametric Synthesis

Abstract: Existing automated dubbing methods are usually designed for Professionally Generated Content (PGC) production, which requires massive training data and training time to learn a person-specific audio-video mapping. In this paper, we investigate an audio-driven dubbing method that is more feasible for User Generated Content (UGC) production. There are two unique challenges to design a method for UGC: 1) the appearances of speakers are diverse and arbitrary as the method needs to generalize across users; 2) the a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
references
References 62 publications
(98 reference statements)
0
0
0
Order By: Relevance