2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings
DOI: 10.1109/icassp.2006.1659961
|View full text |Cite
|
Sign up to set email alerts
|

HSMM-Based Model Adaptation Algorithms for Average-Voice-Based Speech Synthesis

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 18 publications
(11 citation statements)
references
References 10 publications
0
11
0
Order By: Relevance
“…The procedure of warping factor generation by GMM-based method is illustrated in Figure 3. (3) where A is the transform matrix to estimate, B is the bias vector, is original model mean and is the adapted mean. The transform matrix A is estimated by maximizing likelihood of adaptation data O from target speaker,…”
Section: Frequency Warping For Speaker Adaptation Of Speech Synthmentioning
confidence: 99%
See 1 more Smart Citation
“…The procedure of warping factor generation by GMM-based method is illustrated in Figure 3. (3) where A is the transform matrix to estimate, B is the bias vector, is original model mean and is the adapted mean. The transform matrix A is estimated by maximizing likelihood of adaptation data O from target speaker,…”
Section: Frequency Warping For Speaker Adaptation Of Speech Synthmentioning
confidence: 99%
“…Voice transformation is generally used to convert the speech synthesized by unit selection based waveform concatenation TTS system. Speaker adaption adjusts the parameters of Maximum a posterior (MAP), maximum likelihood linear regression (MLLR) and speaker adaptive training (SAT), which are originally developed for automatic speech recognition (ASR), are applied to change the voice characteristics of the speech generated by statistical parametric speech synthesis systems [3,4].…”
Section: Introductionmentioning
confidence: 99%
“…Recently, voice conversion in the framework of hidden Markov model based speech synthesis has also become a popular topic (e.g. [5]). …”
Section: Introductionmentioning
confidence: 99%
“…Several adaptation algorithms have been borrowed from speech recognition and further developed [10] for HMM-based speech synthesis. Since the purpose of speaker adaptation for speech synthesis is different from that for speech recognition, a speech synthesis-specific adaptation algorithm, called Minimum Generation Error Linear Regression (MGELR), has also been proposed [13].…”
Section: From Intra-lingual To Crossmentioning
confidence: 99%
“…This is achieved by modifying the HMM parameters using model adaptation technique. Several model adaptation algorithms, which were originally proposed for speech recognition, including Maximum a Posteriori (MAP), Maximum Likelihood Linear Regression (MLLR) [7], Constrained MLLR (CMLLR) [8], and so on, have been applied to HMM-based speech synthesis [9,10]. It has been demonstrated that speaker adaptation of an "Average Voice" model [11] is superior to speaker adaptation of a speaker-dependent model.…”
Section: Introductionmentioning
confidence: 99%