Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
DOI: 10.1109/icassp.2005.1415104
|View full text |Cite
|
Sign up to set email alerts
|

Additive Modeling of English F0 Contour for Speech Synthesis

Abstract: In this paper, we present an approach to fundamental frequency contour modeling of English for speech synthesis, based on a statistical learning technique called Additive Models that was successfully applied to the modeling of Japanese F0 contour previously.In an attempt to model English F0 contour, we defined a threelayer additive model consisting of an intonational phrase component, a word-level component representing lexical stress types, and a pitch-accent component related to accented syllables. These com… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
17
0

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 15 publications
(23 citation statements)
references
References 18 publications
1
17
0
Order By: Relevance
“…Prediction errors showed in table 1 indicate that the TTS results obtained using our clustering technique are comparable with other approaches found in the bibliography (see [14] for a ranking). Informal listening tests have been done to assess the goodness of the synthetic intonation.…”
Section: Resultssupporting
confidence: 64%
“…Prediction errors showed in table 1 indicate that the TTS results obtained using our clustering technique are comparable with other approaches found in the bibliography (see [14] for a ranking). Informal listening tests have been done to assess the goodness of the synthetic intonation.…”
Section: Resultssupporting
confidence: 64%
“…Fundamental frequency (F0) is the most important acoustic correlate of tone in spoken Mandarin. There have been numerous studies on F0 modeling [1][2][3][4][5][6][7][8]. These studies are roughly around three issues.…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, in this structure, each training data sample contributes to modeling multiple mean vectors and covariance matrices. Many papers applied the additive structure just for F0 modeling [37][38][39][40]. Authors in [37] proposed an additive structure with multiple decision trees for mean vectors and a single tree for variance terms.…”
Section: Introductionmentioning
confidence: 99%
“…Acoustic modeling with contextual additive structure has also been proposed to represent dependencies between contextual factors and acoustic features more precisely [19,20,23,32,[36][37][38][39][40]. In this structure, acoustic trajectories are considered to be a sum of independent acoustic components which have different context dependencies (different decision trees have to be trained for those components).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation