2016
DOI: 10.1007/s10772-016-9386-9
|View full text |Cite
|
Sign up to set email alerts
|

Modification of energy spectra, epoch parameters and prosody for emotion conversion in speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
7
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 24 publications
0
7
0
Order By: Relevance
“…Accordingly, a subjective evaluation was performed using the comparison mean opinion score (CMOS) of the evaluation measures, and speaker similarity. CMOS/MOS tests have been used for drawing similarities between the synthesized and target emotions [13], [15], [17], [33], [57], [84]- [86], while speaker-similarity scores provide the extent to which the identity of a speaker is preserved after conversion. The ranking scales used for estimating CMOS and speaker similarity are explained in Tables.…”
Section: B Subjective Measuresmentioning
confidence: 99%
“…Accordingly, a subjective evaluation was performed using the comparison mean opinion score (CMOS) of the evaluation measures, and speaker similarity. CMOS/MOS tests have been used for drawing similarities between the synthesized and target emotions [13], [15], [17], [33], [57], [84]- [86], while speaker-similarity scores provide the extent to which the identity of a speaker is preserved after conversion. The ranking scales used for estimating CMOS and speaker similarity are explained in Tables.…”
Section: B Subjective Measuresmentioning
confidence: 99%
“…Subjective evaluations have been conducted using the comparative mean opinion score (CMOS) in this work. A CMOS test is conducted for evaluating the similarity of the synthesized speech in relation to the target emotion [22], [24], [26].…”
Section: B Subjective Measuresmentioning
confidence: 99%
“…RB approach is relatively simple and direct when compared to other methods, the rules employed decides the naturalness and quality of the emotional speech. The rule-based approaches have been used in English [10], Dutch [11], Spanish [12], Catalan [13], German [14], Korean [15] and some Indian languages [16,17,18].…”
Section: Modification Of Prosody For Emotion Conversion Using Gaussian Regression Modelmentioning
confidence: 99%
“…Pathak [34] has used Discrete Wavelet Transform (DWT) for modeling emotional speech of Source and Target speakers. Haque [17] has used spectral energy, epoch strength and epoch sharpness with Pitch and intensity. Filter bank approach was used to modify energy spectra and the pitch contour of target emotion was predicted using Gaussian Normalization and polynomial regression method.…”
Section: Modification Of Prosody For Emotion Conversion Using Gaussian Regression Modelmentioning
confidence: 99%
See 1 more Smart Citation