2017 25th European Signal Processing Conference (EUSIPCO) 2017
DOI: 10.23919/eusipco.2017.8081277
|View full text |Cite
|
Sign up to set email alerts
|

Binaural speech enhancement with spatial cue preservation utilising simultaneous masking

Abstract: Abstract-Binaural multi-microphone noise reduction methods aim at noise suppression while preserving the spatial impression of the acoustic scene. Recently, a new binaural speech enhancement method was proposed which chooses per timefrequency (TF) tile either the enhanced target or a suppressed noisy version. The selection between the two is based on the input SNR per TF tile. In this paper we modify this method such that the selection mechanism is based on the output SNR. The proposed modification of deciding… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…Next, to measure the reduction of the interferers and diffuse noise components with the beamforming process, a signal to noise ratio gain (SNR-gain, array gain), a signal to interferers ratio gain (SIR-gain), and a signal to diffuse noise ratio gain (SDNR-gain) are computed on each side, providing the difference in dB between the SNR, SIR, and SDNR at the beamformer output and at the input reference microphone: (40) where in the above cross-and auto-PSDs x ref , v ref and n ref refer to the target, interferers and diffuse noise components at a reference microphone, while z x , z v and z n refer to the corresponding components in the beamformer output signal. Finally, to measure the target distortion on each side after processing, two measurements are used: a target Speech Distortion Ratio (SDR) and a Speech Distortion Magnitude-only distance (SDmag).…”
Section: Performance Measurementmentioning
confidence: 99%
See 1 more Smart Citation
“…Next, to measure the reduction of the interferers and diffuse noise components with the beamforming process, a signal to noise ratio gain (SNR-gain, array gain), a signal to interferers ratio gain (SIR-gain), and a signal to diffuse noise ratio gain (SDNR-gain) are computed on each side, providing the difference in dB between the SNR, SIR, and SDNR at the beamformer output and at the input reference microphone: (40) where in the above cross-and auto-PSDs x ref , v ref and n ref refer to the target, interferers and diffuse noise components at a reference microphone, while z x , z v and z n refer to the corresponding components in the beamformer output signal. Finally, to measure the target distortion on each side after processing, two measurements are used: a target Speech Distortion Ratio (SDR) and a Speech Distortion Magnitude-only distance (SDmag).…”
Section: Performance Measurementmentioning
confidence: 99%
“…A challenge for this classification algorithm is its applicability in low input SNR environments, as most T-F bins can be classified as noise-dominant, resulting in low SNR improvement and an attenuated target output, as illustrated in [39]. As an attempt to enhance the performance of this method, the classification mechanism was later modified to use the output SNR instead of the input SNR [40]. However, this method requires an estimation of the second order statistics of the noise and the target components, which, as previously described, can be challenging in some real-life time-varying multi-talker environments.…”
Section: Introductionmentioning
confidence: 99%
“…In [13], a Maximum Likelihood (ML) scheme was proposed for estimating the speech and noise PSDs. This method has been successfully used for scientific [15] as well as industrial [14,16] applications. However, typically there are some frequency bins where the ML estimation scheme suggests negative values for the speech spectrum.…”
Section: Introductionmentioning
confidence: 99%
“…More precisely, after applying the BMVDR beamformer both output components exhibit the binaural cues of the desired source component, i.e., both components are perceived as coming from the direction of the desired source such that the binaural hearing advantage cannot be exploited by the auditory system. Aiming at additionally preserving the binaural cues of the noise component and hence preserving the spatial impression of the complete acoustic scene, several extensions of the binaural MWF and the BMVDR beamformer have been proposed, e.g., by incorporating constraints into the spatial filter design [10]- [15] or by mixing with scaled (noisy) reference microphone signals [9], [16]- [20].…”
mentioning
confidence: 99%