2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP) 2017
DOI: 10.1109/atsip.2017.8075528
|View full text |Cite
|
Sign up to set email alerts
|

A comparative study of voice conversion techniques: A review

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
5
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 33 publications
0
5
0
Order By: Relevance
“…Voice conversion (VC) [1] aims to modify a speech signal uttered by a source speaker to sound as if it is uttered by a target speaker while retaining the linguistic information. Various approaches have been proposed [2] for voice conversion.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Voice conversion (VC) [1] aims to modify a speech signal uttered by a source speaker to sound as if it is uttered by a target speaker while retaining the linguistic information. Various approaches have been proposed [2] for voice conversion.…”
Section: Introductionmentioning
confidence: 99%
“…Various approaches have been proposed [2] for voice conversion. As parallel data [1] is expensive to collect, non-parallel methods [3,4,5] have received significant attention. Among them, phonetic posteriorgram (PPG) [5] based method is one of the most popular implementations.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Due to the extensive use of the esophageal voice by laryngectomees, this type of voice has been the subject of numerous studies in the last few years. To our knowledge, the existing approaches for ES quality improvements can be summarized into three categories: approaches based on the transformation of acoustic features, such as formant synthesis [4], comb filtering [5], and smoothing of acoustic parameters [6]; approaches based on statistical techniques, where [7][8][9] have been carried out, and approaches based on the VC technique, which allows for the transformation of the voice of a source speaker (laryngectomee) into that of a target speaker (laryngeal) [10][11][12][13][14][15][16]. Although these approaches have of course improved the estimation of the acoustic characteristics to reconstruct a converted signal with better quality, the improvements in intelligibility and naturalness are still insufficient.…”
Section: Introductionmentioning
confidence: 99%
“…For decades, the voice has attracted considerable attention from researchers. In speech processing, several areas emerge, such as spoken language recognition [13], automatic speech recognition [14], speaker verification [3], emotion recognition [22], speech understanding [11], voice transformation [21] or conversion [5]. Research efforts in this quite diverse list of areas share one common trait, in terms of the raw material being worked on: most focus on natural voice recordings -spontaneous or read speech, telephone recordings, or speech resulting from human-machine dialogues (through, for example, voice assistants).…”
Section: Introductionmentioning
confidence: 99%