2018
DOI: 10.1016/j.specom.2018.04.006
|View full text |Cite
|
Sign up to set email alerts
|

The impact of the Lombard effect on audio and visual speech recognition systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
25
0
4

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 23 publications
(37 citation statements)
references
References 33 publications
2
25
0
4
Order By: Relevance
“…This scenario allows to take into account the two factors responsible for the Lombard adaptation: first, speakers tend to regulate their vocal effort based on the auditory feedback, i.e. they involuntarily react to the perceived level of their own speech [17]; secondly, they change their speaking style to communicate better with others [20,21]. The impact of the noise type on the Lombard effect is currently unclear.…”
Section: Audio-visual Corpus and Noise Datamentioning
confidence: 99%
See 1 more Smart Citation
“…This scenario allows to take into account the two factors responsible for the Lombard adaptation: first, speakers tend to regulate their vocal effort based on the auditory feedback, i.e. they involuntarily react to the perceived level of their own speech [17]; secondly, they change their speaking style to communicate better with others [20,21]. The impact of the noise type on the Lombard effect is currently unclear.…”
Section: Audio-visual Corpus and Noise Datamentioning
confidence: 99%
“…2. In the Lombard GRID database, the energy difference between Lombard and neutral utterances is between 3 dB and 13 dB [17]. If we assume that the listener is immersed in SSN at 80 dB SPL, like in the recording conditions of the Lombard GRID database, and that the conversational speech level is between 60 and 70 dB SPL [29,30], the SNR is between -17 dB and 3 dB.…”
Section: Additive Noise Levelsmentioning
confidence: 99%
“…Dessa forma, a potência do ruído é reduzida, as distorções na fala são minimizadas e as informações de localização da fonte desejada são preservadas. No entanto, como efeito adverso, o MWF distorce as pistas biauriculares do ruído, fazendo com que a localização percebida do ruído seja a mesma da fonte associada à fala [11]. De forma a controlar o efeito indesejado que o MWF gera sobre as pistas de localização do ruído residual, métodos recentes de redução de ruído incorporam, à função custo original do MWF, um termo adicional de penalização sobre as distorções Diego M. do Carmo, Ricardo A. Borsoi e Márcio H. Costa Universidade Federal de Santa Catarina.…”
Section: Introductionunclassified
“…Most of both research and application areas are, however, related to human (and human-computer) communication, telecommunications, etc. [9], [19], [20]. Especially important are strategies for improving speech comprehensibility in noisy conditions based on various techniques, including speech modeling.…”
Section: Introductionmentioning
confidence: 99%