2013
DOI: 10.1007/978-3-319-01931-4_24
|View full text |Cite
|
Sign up to set email alerts
|

Improving Prosodic Break Detection in a Russian TTS System

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 6 publications
0
5
0
Order By: Relevance
“…This classifier was used in our previous work on predicting breaks in Russian. We compared this classifier to the CART classifier and found it to give a (slightly) more accurate result and to be more flexible and easier to tune than CART (see [8] for further details). Its disadvantage compared to CART is that it cannot be efficiently used for predicting break duration, since only break/nonbreak decisions are made; in this paper, we deal only with break positions and not break durations.…”
Section: The Random Forests Classifiermentioning
confidence: 99%
“…This classifier was used in our previous work on predicting breaks in Russian. We compared this classifier to the CART classifier and found it to give a (slightly) more accurate result and to be more flexible and easier to tune than CART (see [8] for further details). Its disadvantage compared to CART is that it cannot be efficiently used for predicting break duration, since only break/nonbreak decisions are made; in this paper, we deal only with break positions and not break durations.…”
Section: The Random Forests Classifiermentioning
confidence: 99%
“…To perform this we need a source audiobook (or similar) database comprising more than 20 hours of speech, where each sound file has a corresponding label file containing information about its speech elements [2]. First of all, linguistic and acoustic features [3,8] are calculated for the audiobook database. Then, voice model building is performed.…”
Section: The Proposed Systemmentioning
confidence: 99%
“…Then linguistic and prosodic features are calculated [3,8]. At the next step, HMM prototypes for each speech element in the database are created.…”
Section: Database Preprocessingmentioning
confidence: 99%
“…• sum of products of multiplication of the subband number (1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12) and place in the ranking of the subbands energy, in descending order (subb rank score, 82);…”
Section: Frequencies In Subbandsmentioning
confidence: 99%