1999
DOI: 10.1016/s0167-6393(99)00008-4
|View full text |Cite
|
Sign up to set email alerts
|

Rules for the generation of ToBI-based American English intonation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
9
0

Year Published

2001
2001
2014
2014

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 28 publications
(10 citation statements)
references
References 20 publications
(13 reference statements)
1
9
0
Order By: Relevance
“…Previously observed interaction between focus and sentence modality in terms of surface F 0 contours (Cooper et al, 1986;Pell, 2001;Xu and Xu, 2005) is successfully simulated using only 26 sets of categorical parameters representing four functional layers: stress, focus, syllable position and sentence modality. Compared to previous attempts to model English intonation (Jilka et al, 1999;Grabe et al, 2007;Taylor, 2000), the present results show both accurate F 0 contours and high generalizability, as the learned parameters are directly related to communicative functions. Bold-face indicates a focus placement and underline indicates a stress syllable of that word.…”
Section: Englishmentioning
confidence: 44%
See 2 more Smart Citations
“…Previously observed interaction between focus and sentence modality in terms of surface F 0 contours (Cooper et al, 1986;Pell, 2001;Xu and Xu, 2005) is successfully simulated using only 26 sets of categorical parameters representing four functional layers: stress, focus, syllable position and sentence modality. Compared to previous attempts to model English intonation (Jilka et al, 1999;Grabe et al, 2007;Taylor, 2000), the present results show both accurate F 0 contours and high generalizability, as the learned parameters are directly related to communicative functions. Bold-face indicates a focus placement and underline indicates a stress syllable of that word.…”
Section: Englishmentioning
confidence: 44%
“…RMSE indicates the average mismatch of the contours while correlation indicates the mismatch between the shape and the alignment of the contours. These two measurements have been shown to be effective (Hermes, 1998), and have been widely used as computational metrics in previous prosody modeling works (Black and Hunt, 1996;Jilka et al, 1999;Prom-on et al, 2009Prom-on et al, , 2011Prom-on et al, , 2012Ross and Ostendorf, 1999;Taylor, 2000).…”
Section: Testing Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We add a final traversal of utterance's phonetic representation so that the server can output a series of visemes and animation commands corresponding to a synthesized waveform. For RUTH, we have also reinstrumented Festival (debugging and extending the standard release) to control pitch by annotation; 28,29 we use OGI CSLU synthesis and voices. 30 Animation schedules and speech waveforms output by Festival can be saved, reused and modified directly.…”
Section: Interfacing With Speechmentioning
confidence: 99%
“…In terms of the development method, the models are constructed based on a rule-based approach (Anderson, Pierrehumbert & Liberman, 1984;Allen, Hunnicutt & Klatt, 1987;Jilka, Mohler & Dogil, 1999) or a corpus-based one (Traber, 1992;Manna & Quazza, 1995;Black & Hunt, 1996;Ross & Ostendorf, 1999). The model parameters in the rule-based approach are given by designers through a considerable effort of trial-and-error, while those in the corpus-based one are obtained in a statistical way.…”
Section: Introductionmentioning
confidence: 99%