Analysis of voice fundamental frequency contours for declarative sentences of Japanese.

Fujisaki, Hiroya; Hirose, Keikichi

doi:10.1250/ast.5.233

Cited by 352 publications

(215 citation statements)

References 2 publications

Supporting

Mentioning

210

Contrasting

Unclassified

Order By: Relevance

“…The interaction between focus-type information and expressive-type information has been examined by researchers such as Menezes [50], who examined articulatory X-ray microbeam data and found an effect on jaw opening and phrasing as a function of the interaction between contrastive emphasis and irritation. Models of intonation such as the ones by e.g., Fujisaki and Hirose [55], Mozziconacci [56] or t'Hart et al [57] have been proposed to account for the way that intonation is used to convey expressivity in speech. However, a comprehensive model that incorporates not only the intonational aspects, but also the rhythm aspects, as well as the linguistic and sociolinguistc aspects of expressive speech has yet to be developed [58].…”

Section: Future Workmentioning

confidence: 99%

“…The rhythm and stress are represented by a temporal sequence of syllable triangles, the height of which is derived from jaw opening measurements, and the angles of the triangles are calculated in a simple fashion such that all angles are the same, unless the syllable types (such as syllable weight and the use of certain types of onset/coda characteristics) are different and the corresponding gaps between the base of the triangles indicate the magnitudes of the phonetic phrase boundaries. The F 0 values associated with each syllable of the utterance are connected in a way similar to the Fujisaki model (e.g., [55]) to show the intonation contour of the utterance. The study shows that for expressive speech, the stress pattern represented by syllable and boundary magnitudes often works independently from intonation, e.g., for suspicion, the prominent syllable (/won/) actually has the lowest F 0 value.…”

Section: Future Workmentioning

confidence: 99%

See 1 more Smart Citation

Expressive speech: Production, perception and application to speech synthesis

Erickson

2005

Acoust. Sci. & Tech.

View full text Add to dashboard Cite

This paper reviews some of the recent issues and findings in the area of production and perception of expressive speech and the application to speech synthesis. Specifically, it discusses some of the current problems with data collection, labeling, techniques for analyzing voice quality and applying speech synthesis as an analysis tool. Directions for future work in order to improve synthesis of expressive speech are suggested along the lines of better modeling, labeling and voice quality analysis.

show abstract

Section: Future Workmentioning

confidence: 99%

Section: Future Workmentioning

confidence: 99%

Expressive speech: Production, perception and application to speech synthesis

Erickson

2005

Acoust. Sci. & Tech.

View full text Add to dashboard Cite

show abstract

“…In particular, it is important to differentiate what TCoG is meant to accomplish from what other existing models of contour shape both aim at and succeed in doing. A pproaches such as the Fujisaki model (Fujisaki and Hirose 1984), Momel (Hirst and Espesser 1993), and Tilt (Taylor 2000), for example, all encode contour shape in one form or another, but differ from TCoG in important ways. The Fujisaki model and Momel are concerned with representing, with as little deviation as possible, the precise shape of each individual F 0 contour.…”

Section: Tonal Center Of Gravity 371mentioning

confidence: 99%

Tonal Center of Gravity: A global approach to tonal implementation in a level-based intonational phonology

Barnes

Veilleux

Brugos

et al. 2012

Laboratory Phonology

View full text Add to dashboard Cite

show abstract

“…However, given a speech synthesis system without specific focus control, it is not efficient to prepare a large speech corpus with focus control and train the speech synthesis system from the beginning. While we have developed a corpus-based method of synthesizing F0 contours in the framework of the generation process model (F0 model) [1] and realized speech synthesis in reading and dialogue styles with various emotions [2,3].…”

Section: Introductionmentioning

confidence: 99%

Control of prosodic focus in corpus-based generation of fundamental frequency contours of Japanese based on the generation process model

Ochi

Hirose

Minematsu

2009

2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Self Cite

View full text Add to dashboard Cite

A method was developed for generating sentence F0 contours, when a focus is placed in one of bunsetsu of an utterance. The method is to predict differences in F0 model commands between with and without focus utterances, and applies them to the F0 model commands predicted beforehand by the baseline method. The validity of the method was proved by the experiment on F0 contour generation and speech synthesis.

show abstract

Analysis of voice fundamental frequency contours for declarative sentences of Japanese.

Cited by 352 publications

References 2 publications

Expressive speech: Production, perception and application to speech synthesis

Expressive speech: Production, perception and application to speech synthesis

Tonal Center of Gravity: A global approach to tonal implementation in a level-based intonational phonology

Control of prosodic focus in corpus-based generation of fundamental frequency contours of Japanese based on the generation process model

Contact Info

Product

Resources

About