Robust Speech Recognition of Uncertain or Missing Data 2011
DOI: 10.1007/978-3-642-21317-5_7
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Speech Recognition Using Missing Data Techniques: Handling of Real-World Data

Abstract: The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2012
2012
2012
2012

Publication Types

Select...
2
2
1

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 23 publications
0
6
0
Order By: Relevance
“…First, while it has been shown MDT can be used to combat reverberation [16,25], to date no method has been presented that enables the estimation of reverberation-dominated features in noisy environments. Second, future work will address the poor performance of current mask estimation methods on speech corrupted by background music, a prevailing problem in searching audion archives.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…First, while it has been shown MDT can be used to combat reverberation [16,25], to date no method has been presented that enables the estimation of reverberation-dominated features in noisy environments. Second, future work will address the poor performance of current mask estimation methods on speech corrupted by background music, a prevailing problem in searching audion archives.…”
Section: Discussionmentioning
confidence: 99%
“…All speech material was simultaneously recorded with four different microphones (channels) at increasing distances, resulting in utterances corrupted by varying levels of noise. We used the method described in [16] to obtain SNR estimates of all utterances.…”
Section: Real-world Data: the Speecon And Speechdat-car Databasesmentioning
confidence: 99%
See 2 more Smart Citations
“…In order to show the speed improvement of MC MDT over a full MDT system [45], i.e., where the CLSQ problem (11) is solved per Gaussian with GD, an acoustic model with Gaussians estimated on PROSPECT features is required. The model has 21,037 PROSPECT Gaussians which are obtained by Single Pass Retraining (SPR) [46] of the acoustic model with MIDA features.…”
Section: Training Backend Prospect Modelmentioning
confidence: 99%