2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016
DOI: 10.1109/icassp.2016.7471756
|View full text |Cite
|
Sign up to set email alerts
|

Jointly optimal near-end and far-end multi-microphone speech intelligibility enhancement based on mutual information

Abstract: The processing required for the global maximization of the intelligibility of speech acquired by multiple microphones and rendered by a single loudspeaker, is considered in this paper. The intelligibility is quantized, based on the mutual information rate between the message spoken by the talker and the message as interpreted by the listener. We prove that then, in each of a set of narrow-band channels, the processing can be decomposed into a minimum variance distortionless response (MVDR) beamforming operatio… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 23 publications
(44 reference statements)
0
2
0
Order By: Relevance
“…MI-based methods for joint far-and near-end SI enhancement [8,10] improve SI by maximizing the MI, I(S; Z), between the clean speech, S, and the signal received by the listener, Z.…”
Section: Existing Work Based On Mutual Informationmentioning
confidence: 99%
See 1 more Smart Citation
“…MI-based methods for joint far-and near-end SI enhancement [8,10] improve SI by maximizing the MI, I(S; Z), between the clean speech, S, and the signal received by the listener, Z.…”
Section: Existing Work Based On Mutual Informationmentioning
confidence: 99%
“…In [9] a new training strategy is proposed for deep learning based single-channel enhancement given that speech has already been processed at the far-end. For joint multi-microphone FSE and NLE [8,10] proposes to optimize the Mutual Information (MI) [11] between the clean speech and the signal received by the listener. The results of [8] are the first to show both theoretically and experimentally that joint processing, using knowledge of processing and conditions at both ends, is superior to the classic disjoint processing.…”
Section: Introductionmentioning
confidence: 99%