Interspeech 2017 2017
DOI: 10.21437/interspeech.2017-400
|View full text |Cite
|
Sign up to set email alerts
|

Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs

Abstract: Speaking style conversion is the technology of converting natural speech signals from one style to another. In this study, we focus on normal-to-Lombard conversion. This can be used, for example, to enhance the intelligibility of speech in noisy environments. We propose a parametric approach that uses a vocoder to extract speech features. These features are mapped using Bayesian GMMs from utterances spoken in normal style to the corresponding features of Lombard speech. Finally, the mapped features are convert… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
3
3
1

Relationship

3
4

Authors

Journals

citations
Cited by 17 publications
(13 citation statements)
references
References 21 publications
0
12
0
Order By: Relevance
“…In the baseline system with parallel data, a standard GMM is used as it was shown to compare well against DNNs and non-parametric Bayesian methods in an earlier study with the present data set [14].…”
Section: Parallel Gmm Learningmentioning
confidence: 87%
See 3 more Smart Citations
“…In the baseline system with parallel data, a standard GMM is used as it was shown to compare well against DNNs and non-parametric Bayesian methods in an earlier study with the present data set [14].…”
Section: Parallel Gmm Learningmentioning
confidence: 87%
“…Moreover, the amount of parallel training data, where the utterances in the source and target styles are from the same speaker speaking the same linguistic content, is limited. Our earlier work [14,15] also suggest that the limited availability of parallel data in normal and Lombard styles causes a bottleneck in system performance. This encourages the use of nonparallel mapping models within the parametric SSC framework.…”
Section: Introductionmentioning
confidence: 92%
See 2 more Smart Citations
“…SSC has been previously studied in whisper-to-normal conversion [3][4][5] and in normal-to-Lombard conversion [6][7][8]. In addition, a parametric approach to normal-to-Lombard SSC was recently explored in [9], where a vocoder was used to extract frame level features that were then transformed from normal to Lombard style using parallel data-driven mapping models, and then synthesized as speech in the target style using the same vocoder.…”
Section: Introductionmentioning
confidence: 99%