Jointly optimal near-end and far-end multi-microphone speech intelligibility enhancement based on mutual information

Khademi, Seyran; Hendriks, Richard C.; Kleijn, W. Bastiaan

doi:10.1109/icassp.2016.7471756

Cited by 5 publications

(2 citation statements)

References 23 publications

(44 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…MI-based methods for joint far-and near-end SI enhancement [8,10] improve SI by maximizing the MI, I(S; Z), between the clean speech, S, and the signal received by the listener, Z.…”

Section: Existing Work Based On Mutual Informationmentioning

confidence: 99%

“…In [9] a new training strategy is proposed for deep learning based single-channel enhancement given that speech has already been processed at the far-end. For joint multi-microphone FSE and NLE [8,10] proposes to optimize the Mutual Information (MI) [11] between the clean speech and the signal received by the listener. The results of [8] are the first to show both theoretically and experimentally that joint processing, using knowledge of processing and conditions at both ends, is superior to the classic disjoint processing.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Joint Far- and Near-End Speech Intelligibility Enhancement Based on the Approximated Speech Intelligibility Index

Fuglsig

Østergaard

Jensen

et al. 2022

ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

This paper considers speech enhancement of signals picked up in one noisy environment which must be presented to a listener in another noisy environment. Recently, it has been shown that an optimal solution to this problem requires the consideration of the noise sources in both environments jointly. However, the existing optimal mutual information based method requires a complicated system model that includes natural speech variations, and relies on approximations and assumptions of the underlying signal distributions. In this paper, we propose to use a simpler signal model and optimize speech intelligibility based on the Approximated Speech Intelligibility Index (ASII). We derive a closed-form solution to the joint far-and nearend speech enhancement problem that is independent of the marginal distribution of signal coefficients, and that achieves similar performance to existing work. In addition, we do not need to model or optimize for natural speech variations.

show abstract

“…MI-based methods for joint far-and near-end SI enhancement [8,10] improve SI by maximizing the MI, I(S; Z), between the clean speech, S, and the signal received by the listener, Z.…”

Section: Existing Work Based On Mutual Informationmentioning

confidence: 99%