IEEE International Conference on Acoustics Speech and Signal Processing 2002
DOI: 10.1109/icassp.2002.1005676
|View full text |Cite
|
Sign up to set email alerts
|

Missing data speech recognition in reverberant conditions

Abstract: In this study we describe an auditory processing front-end for missing data speech recognition, which is robust in the presence of reverberation. The model attempts to identify time-frequency regions that are not badly contaminated by reverberation and have strong speech energy. This is achieved by applying reverberation masking. Subsequently, reliable time-frequency regions are passed to a 'missing data' speech recogniser for classification. We demonstrate that the model improves recognition performance in th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2002
2002
2012
2012

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 6 publications
0
4
0
Order By: Relevance
“…Is perception governed by the less distorted, 0.32-m bands in these sounds? This might happen if hearing behaves like a "missing data" speech recogniser, and bases its decisions on the less distorted parts of the signal (Palomäki, Brown and Barker 2002). Clearly, this could not be happening on a word by word basis with the listeners in this experiment, as there would be no effects of distance when only 4 of the test-word"s bands are given the 10-m reflection patterns.…”
Section: Ish 2009mentioning
confidence: 91%
“…Is perception governed by the less distorted, 0.32-m bands in these sounds? This might happen if hearing behaves like a "missing data" speech recogniser, and bases its decisions on the less distorted parts of the signal (Palomäki, Brown and Barker 2002). Clearly, this could not be happening on a word by word basis with the listeners in this experiment, as there would be no effects of distance when only 4 of the test-word"s bands are given the 10-m reflection patterns.…”
Section: Ish 2009mentioning
confidence: 91%
“…Therefore, our future work consists of experimenting with multi-channel acoustic models in the MDT framework. Also, we will investigate the effectiveness of measures against reverberation [16].…”
Section: Discussionmentioning
confidence: 99%
“…• It is assumed that the speech signal is disturbed by additive possibly nonstationary background noise. Significant reverberation cannot be handled, though mask estimation methods to handle reverberated speech are described in the literature [18,19]. If the user wants robustness against reverberation, he will need to implement his own mask estimation technique.…”
Section: Extension To a Missing Data Theory Based Recognisermentioning
confidence: 99%