2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2012
DOI: 10.1109/icassp.2012.6288968
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of front-end compensation strategies for robust LVCSR under room reverberation and increased vocal effort

Abstract: Automatic speech recognition is known to deteriorate in the presence of room reverberation and variation of vocal effort in speakers. This study considers robustness of several state-of-the-art front-end feature extraction and normalization strategies to these sources of speech signal variability in the context of large vocabulary continuous speech recognition (LVCSR). A speech database recorded in an anechoic room, capturing modal speech and speech produced at different levels of vocal effort, is reverberated… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2013
2013
2016
2016

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 20 publications
0
3
0
Order By: Relevance
“…Mean Hilbert envelope coefficients (MHECs) were recently proposed for noise robust speech, speaker, and language recognition [61,66]. It uses the output of each filter in the filterbank.…”
Section: Mean Hilbert Envelope Coefficients (Mhecs)mentioning
confidence: 99%
“…Mean Hilbert envelope coefficients (MHECs) were recently proposed for noise robust speech, speaker, and language recognition [61,66]. It uses the output of each filter in the filterbank.…”
Section: Mean Hilbert Envelope Coefficients (Mhecs)mentioning
confidence: 99%
“…In recent years, researchers have been putting a great deal of effort into the development of speech processing algorithms that would maintain good performance in real world conditions. Besides speaker/channel variability and room reverberation [19,21], environmental noise represents one of the most disruptive and hard to deal with factors [22]. Successful modeling and suppression of noise effects in speech engines requires availability of noisy speech data.…”
Section: Communication In Noisementioning
confidence: 99%
“…Environmental -background noise [16] (stationary, impulsive, time-varying, etc. ), room acoustics [17], reverberation [18,19], distant microphone. Data quality -duration, sampling rate, recording quality, audio codec/compression [20].…”
Section: Introductionmentioning
confidence: 99%