2009 IEEE International Conference on Acoustics, Speech and Signal Processing 2009
DOI: 10.1109/icassp.2009.4960485
|View full text |Cite
|
Sign up to set email alerts
|

Non-linear mapping for multi-channel speech separation and robust overlapping spech recognition

Abstract: This paper investigates a non-linear mapping approach to extract robust features for ASR and speech separation of overlapping speech. Based on our previous studies, we continue to use two additional sound sources, namely from the target and interfering speakers. The focuses of this work are: 1) We investigate the feature mapping between different domains with the consideration of MMSE criterion and regression optimizations, demonstrating the mapping of log melfilterbank energies to MFCC can be exploited to imp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2014
2014
2016
2016

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 12 publications
(13 reference statements)
0
3
0
Order By: Relevance
“…The mapping is performed between features extracted from noisy and clean speech signals to obtain an optimal set of parameters through the error backpropagation algorithm [52,53,54]. The goal is to obtain clean or enhanced speech from the noisy input via a nonlinear transformation using neural networks such as a deep denoising autoencoder [55] or a multilayer perceptron (MLP) [56].…”
Section: Feature Mapping Techniques Using Dnnmentioning
confidence: 99%
“…The mapping is performed between features extracted from noisy and clean speech signals to obtain an optimal set of parameters through the error backpropagation algorithm [52,53,54]. The goal is to obtain clean or enhanced speech from the noisy input via a nonlinear transformation using neural networks such as a deep denoising autoencoder [55] or a multilayer perceptron (MLP) [56].…”
Section: Feature Mapping Techniques Using Dnnmentioning
confidence: 99%
“…In the Pascal Speech Separation Challenge [6], recognizing a target speech in the presence of another talker's speech was evaluated in a monaural scenario. A multichannel approach has also been studied [7]- [11] because it is more effective for speech separation.…”
Section: Introductionmentioning
confidence: 99%
“…recognizing speech from multiple distant microphones (multichannel) for multiparty meetings where more than one speaker can be active at the same time). The basic idea [210]- [211] to achieve this is to find a mapping (by a neural network or some regression analysis) between the log FBEs of signals from distant microphones and the log FBEs of clean signal. We therefore expect that the FBEs provide a reasonably effective and discriminative representation space of the speech signal towards differentiating the effects of noise injection and noise-suppression (i.e.…”
Section: Suppressed Speechmentioning
confidence: 99%