2021
DOI: 10.1016/j.specom.2021.09.004
|View full text |Cite
|
Sign up to set email alerts
|

Increasing speech intelligibility and naturalness in noise based on concepts of modulation spectrum and modulation transfer function

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 28 publications
0
2
0
Order By: Relevance
“…In this work, we mainly focused on improving TTS speech intelligibility. In related work, it has been suggested that speech intelligibility and naturalness do not always imply each other [52], and thus improvement in intelligibility might not necessarily improve naturalness. In overall, our subjective evaluation results revealed that the proposed systems achieved a significant improvement in speech intelligibility while also preserving speech naturalness.…”
Section: E Subjective Evaluationmentioning
confidence: 99%
“…In this work, we mainly focused on improving TTS speech intelligibility. In related work, it has been suggested that speech intelligibility and naturalness do not always imply each other [52], and thus improvement in intelligibility might not necessarily improve naturalness. In overall, our subjective evaluation results revealed that the proposed systems achieved a significant improvement in speech intelligibility while also preserving speech naturalness.…”
Section: E Subjective Evaluationmentioning
confidence: 99%
“…The Lombard effect is known to impact the performance of speech recognition systems unfavorably. Various researchers have analyzed Lombard speech produced in different types and levels of noise for speech intelligibility [ 10 , 11 , 12 ], audio and audio-visual speech recognition [ 13 , 14 , 15 , 16 ], speaker recognition [ 17 , 18 , 19 ], and emotional speech analysis [ 20 ]. Overall, an automatic speech recognition system (ASR) performance may be degraded when Lombard speech is present in the speech signal [ 15 , 16 , 21 , 22 , 23 , 24 ].…”
Section: Introductionmentioning
confidence: 99%