2000
DOI: 10.1109/89.876308
|View full text |Cite
|
Sign up to set email alerts
|

Rapid speaker adaptation in eigenvoice space

Abstract: This paper describes a new model-based speaker adaptation algorithm called the eigenvoice approach. The approach constrains the adapted model to be a linear combination of a small number of basis vectors obtained offline from a set of reference speakers, and thus greatly reduces the number of free parameters to be estimated from adaptation data. These "eigenvoice" basis vectors are orthogonal to each other and guaranteed to represent the most important components of variation between the reference speakers. Ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
278
0
1

Year Published

2002
2002
2013
2013

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 421 publications
(286 citation statements)
references
References 36 publications
2
278
0
1
Order By: Relevance
“…Adaptation techniques such as MAP [24], MLLR [25], and eigenspacebased techniques [26] are often used to solve this problem. Although eigenspace-based techniques are effective when adaptation data are extremely small [38], they restrict the model to a lower dimensionality where much information might be lost [39]. On the other hand, MLLR and MAP do not impose this restriction on the models.…”
Section: Statistical Modeling Based On Gaussian Mixture Modelsmentioning
confidence: 99%
“…Adaptation techniques such as MAP [24], MLLR [25], and eigenspacebased techniques [26] are often used to solve this problem. Although eigenspace-based techniques are effective when adaptation data are extremely small [38], they restrict the model to a lower dimensionality where much information might be lost [39]. On the other hand, MLLR and MAP do not impose this restriction on the models.…”
Section: Statistical Modeling Based On Gaussian Mixture Modelsmentioning
confidence: 99%
“…However, if we increase the number of representative HMM sets to enhance the capabilities of representation, it is difficult to determine the interpolation ratio to obtain the required voice. To address this problem, Shichiri et al applied the eigenvoice technique (Kuhn et al, 2000) to HMM-based speech synthesis (Shichiri et al, 2002). A speaker-specific "super-vector" was composed by concatenating the mean vectors of all state-output distributions in the model set for each S speaker-dependent HMM set.…”
Section: Eigenvoice (Producing Voices)mentioning
confidence: 99%
“…This method achieves efficient adaptation. Adaptation techniques which only need a small amount of target speech data, such as those using inter-speaker variation modeling like Eigenvoice [5], have also been proposed. In this framework, the super vectors of the mean parameters of the speaker-dependent acoustic models are used as bases, and the super vector of the new speakerspecific acoustic models is expressed as a linear combination of these bases.…”
Section: Introductionmentioning
confidence: 99%