2020
DOI: 10.1109/access.2020.3015421
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement

Abstract: The Lombard effect is one of the most well-known effects of noise on speech production. Speech with the Lombard effect is more easily recognizable in noisy environments than normal natural speech. Our previous investigations showed that speech synthesis models might retain Lombard-effect characteristics. In this study, we investigate several speech models, such as harmonic, source-filter, and sinusoidal, applied to Lombard speech in the context of speech enhancement. For this purpose, 100 utterances of natural… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 12 publications
(5 citation statements)
references
References 42 publications
0
4
0
Order By: Relevance
“…Since the discovery of LE, this phenomenon has been extensively studied by a wide range of specialists to find solutions to improve the performance of automatic speech recognition systems in noisy environments (Maheswari et al, 2021) or increase speech intelligibility by converting the speaking style from normal to Lombard speech (Li et al, 2020, Kąkol et al, 2020. Also, the basic idea was that LE might be applied to speech synthesizers, allowing them to adapt to noisy conditions , Paul et al, 2020.…”
Section: Introductionmentioning
confidence: 99%
“…Since the discovery of LE, this phenomenon has been extensively studied by a wide range of specialists to find solutions to improve the performance of automatic speech recognition systems in noisy environments (Maheswari et al, 2021) or increase speech intelligibility by converting the speaking style from normal to Lombard speech (Li et al, 2020, Kąkol et al, 2020. Also, the basic idea was that LE might be applied to speech synthesizers, allowing them to adapt to noisy conditions , Paul et al, 2020.…”
Section: Introductionmentioning
confidence: 99%
“…In contrast, the LE may create problems when detecting speech in noise automatically, but not trained on data related to the LE (Vlaj and Kacic, 2011;Marxer et al, 2018;Korvel et al, 2020;Maheswari et al, 2020). Such hyper-articulation impairs the performance of the speech recognition systems (Maheswari et al, 2020), so it is essential to train them on data, including Lombard-related features.…”
Section: Introductionmentioning
confidence: 99%
“…However, it should be remembered that the best judge of speech intelligibility is the human ear. Therefore, there are many attempts to find a correlation between objective measurement results and subjective evaluation [5][6][7][8][9]. In general, the STI value can be determined using two ways, i.e., the direct method based on modulated signals or the indirect method based on the impulse response, according to IEC 60268-16 standard [10].…”
Section: Introductionmentioning
confidence: 99%