2013
DOI: 10.1016/j.csl.2012.06.003
|View full text |Cite
|
Sign up to set email alerts
|

Analysis of the visual Lombard effect and automatic recognition experiments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
8
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(9 citation statements)
references
References 10 publications
0
8
0
Order By: Relevance
“…The training and the evaluation of the systems are usually performed with speech recorded in quiet and afterwards degraded with additive noise. Previous work shows that speaker (Hansen and Varadarajan, 2009) and speech recognition (Junqua, 1993) systems that ignore Lombard effect achieve sub-optimal performance, also in visual (Heracleous et al, 2013;Marxer et al, 2018) and audiovisual settings (Heracleous et al, 2013). It is therefore of interest to conduct a similar study also in a SE context.…”
Section: Introductionmentioning
confidence: 98%
“…The training and the evaluation of the systems are usually performed with speech recorded in quiet and afterwards degraded with additive noise. Previous work shows that speaker (Hansen and Varadarajan, 2009) and speech recognition (Junqua, 1993) systems that ignore Lombard effect achieve sub-optimal performance, also in visual (Heracleous et al, 2013;Marxer et al, 2018) and audiovisual settings (Heracleous et al, 2013). It is therefore of interest to conduct a similar study also in a SE context.…”
Section: Introductionmentioning
confidence: 98%
“…The mismatch between the neutral and the Lombard speaking styles can lead to sub-optimal performance of audio-only-based speaker [15] and speech recognition [2] systems. Only a few works investigate the impact of the Lombard effect on visual [16,17] and audio-visual [16] automatic speech recognition, but, to the best knowledge of the authors, no studies have been conducted for AV-SE systems.…”
Section: Introductionmentioning
confidence: 99%
“…Finally, we report results on sentence-level speech recognition. This is in contrast to previous works which mainly focus either on isolated words [16] or on specific words within a sentence [13]. We believe that the conclusions reached by this approach can be more useful for a practical speech recognition system where the goal will most likely be to recognise all words in a sentence rather than recognise just isolated words.…”
Section: Introductionmentioning
confidence: 81%
“…As expected the improvement is higher when visual Lombard speech is used for training. On the other hand, Heracleous et al [16] reported a performance drop when there is a mismatch between training and testing conditions. The same conclusion was also reached when an audio-visual speech recognition system was used.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation