2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)
DOI: 10.1109/asru.2003.1318526
|View full text |Cite
|
Sign up to set email alerts
|

Transformation of speaker characteristics for voice conversion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 4 publications
0
10
0
Order By: Relevance
“…All combinations between one female and two male speakers showed that higher formants were better converted than lower ones. Rentzos et al (2003) used the formant transformation by a two-dimensional phoneme-dependent hidden Markov models (HMM), glottal pulse LF model transformation, and pitch transformation based on time-domain pitch-synchronous overlapand-add (TD-PSOLA) method. Poles of the LPC model were used for formant estimation.…”
Section: Overview Of Voice Conversion Methodsmentioning
confidence: 99%
“…All combinations between one female and two male speakers showed that higher formants were better converted than lower ones. Rentzos et al (2003) used the formant transformation by a two-dimensional phoneme-dependent hidden Markov models (HMM), glottal pulse LF model transformation, and pitch transformation based on time-domain pitch-synchronous overlapand-add (TD-PSOLA) method. Poles of the LPC model were used for formant estimation.…”
Section: Overview Of Voice Conversion Methodsmentioning
confidence: 99%
“…Voice conversion (VC) is a technique used to modify paralinguistic factors of an utterance from a source speaker to sound like a target speaker. Para-linguistic factors include speaker identity [1], prosody [2] and accent [3], etc. In this paper, we focus on the conversion of speaker identity across arbitrary speakers under a one-shot scenario [4,5], i.e., given only one target speaker's utterance for reference.…”
Section: Introductionmentioning
confidence: 99%
“…Voice conversion (VC) is a task aimed at converting the speech signals from a certain acoustic domain to another while keeping the linguistic content the same. Examples of acoustic domains include not only speaker identity [1,2,3,4], but many other factors orthogonal to the linguistic content, such as speaking style, speaking rate [5], noise condition, emotion [6,7], and accent [8], with potential applications ranging from speech enhancement [9,10], computer-assisted pronunciation training for non-native language learner [8], speaking assistance [11], to name a few. This paper focuses on using VC to improve the speech intelligibility of surgical patients who have had parts of their articulators removed.…”
Section: Introductionmentioning
confidence: 99%