2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6639216
|View full text |Cite
|
Sign up to set email alerts
|

Prediction of creaky voice from contextual factors

Abstract: Creaky voice, also referred to as vocal fry, is a voice quality frequently produced in many languages, in both read and conversational speech. In order to enhance the naturalness of speech synthesisers, these latter should be able to generate speech in all its expressive diversity. This includes a proper use of creaky voice. The goal of this paper is two-fold. Firstly we analyse how contextual factors can be informative for the prediction of creaky use. It is observed that a few contextual factors related to s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2013
2013
2014
2014

Publication Types

Select...
2
1

Relationship

2
1

Authors

Journals

citations
Cited by 3 publications
(7 citation statements)
references
References 15 publications
0
7
0
Order By: Relevance
“…Creaky voice has been studied in relation to various functions in speech communication, and most commonly with phrase or sentence boundary marking (Surana and Slifka, 2006b;Drugman et al, 2013). Similarly creaky voice has been associated with turn-yielding in Finnish (Ogden, 2001).…”
Section: Creaky Voice In Speech Communicationmentioning
confidence: 99%
“…Creaky voice has been studied in relation to various functions in speech communication, and most commonly with phrase or sentence boundary marking (Surana and Slifka, 2006b;Drugman et al, 2013). Similarly creaky voice has been associated with turn-yielding in Finnish (Ogden, 2001).…”
Section: Creaky Voice In Speech Communicationmentioning
confidence: 99%
“…Synthesis of voice with creak requires i) the prediction of creaky parts from context and ii) the ability to render creaky excitation. In our previous work, we have developed methods for creaky voice prediction from context [13] and rendering of creaky excitation [12]. However, these methods have not been utilised in a full TTS voice before.…”
Section: Synthesis Of Creaky Voicementioning
confidence: 99%
“…In this study, the algorithm in [5] is used, which provides a frame-wise probability of creak. This parameter is used as a feature in the HMMtraining for determining if a segment is creaky or not [13]. More specifically, the parameter indicating the probability of creak is trained as an additional 1-dimensional feature along with other speech features, such as F0 and spectrum.…”
Section: Prediction Of Creaky Voice From Contextmentioning
confidence: 99%
See 2 more Smart Citations