2020
DOI: 10.35741/issn.0258-2724.55.4.33
|View full text |Cite
|
Sign up to set email alerts
|

Voice Conversion for Dubbing Using Linear Predictive Coding and Hidden Markov Model

Abstract: Dubbing is a term used to describe filling in the sound on film or video. Voice conversion can be done to support dubbing, for purposes such as obtaining a child’s voice for dubbing on children’s films. However, problems frequently occur with this process, including difficulty finding children’s voice resources and difficulty getting children to express the desired tone and mood while recording. Therefore, in this study, we propose a method for creating a cross-gender and age voice conversion from adult voices… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

0
1
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 11 publications
0
1
0
Order By: Relevance
“…Our work is different from all of these in several important aspects: (a) first, our work considered using real media data from professional voice talents, (b) next, in [18], dubbing was performed for Indonesian language using several words only, (c) in [17], VC was performed for data augmentation and did not investigate target speaker quality and similarity, (d) and finally, in [19] VC was applied to the output of a speech synthesizer.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Our work is different from all of these in several important aspects: (a) first, our work considered using real media data from professional voice talents, (b) next, in [18], dubbing was performed for Indonesian language using several words only, (c) in [17], VC was performed for data augmentation and did not investigate target speaker quality and similarity, (d) and finally, in [19] VC was applied to the output of a speech synthesizer.…”
Section: Introductionmentioning
confidence: 99%
“…One of the studies reported using read speech for training adult to child CycleGAN VC model for ASR application [17]. Other studies by [18], [19] reported using Gaussian mixture model (GMM) based adult to child VC for speaker adaptation and dubbing.…”
Section: Introductionmentioning
confidence: 99%
“…Once having a good disentangle strategy, the model can generate a high quality of speech from the given utterance and style. A successful VC can be applied to various fields, such as personal electrical support as an audio assistant (Lu et al 2021), entertainment usage for dubbing (Mukhneri, Wijayanto, and Hadiyoso 2020), and industrial applications for voice changers, etc.…”
Section: Introductionmentioning
confidence: 99%
“…Voice conversion (VC) aims at transforming the vocal timbre of the source speech to the target speaker while preserving its linguistic content. It has many applications, including movie dubbing [3], speaking assistance [4] and singing [5,6,7]. With the advances of deep learning, neural voice conversion methods have been studied extensively in recent years with highquality natural converted speech [8], such as generative adversarial network (GAN)-based [9,10], variational autoencoder (VAE)-based [11], autoencoder-based [12] and flow-based [13] models, to name a few.…”
Section: Introductionmentioning
confidence: 99%