2022
DOI: 10.1109/taslp.2022.3202121
|View full text |Cite
|
Sign up to set email alerts
|

Dual Microphone Speech Enhancement Based on Statistical Modeling of Interchannel Phase Difference

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 48 publications
0
4
0
Order By: Relevance
“…The proposed dual channel noise PSD estimator based on coherence in (20) shows different characteristics from the single-channel SPP-based noise PSD estimator in (7). Figure 3 shows one example of the noise power spectrum in the beamformer output and the estimates of it for Cafeteria noise at 5 dB SNR.…”
Section: Combining Noise Psd Estimates and Gain Calculationmentioning
confidence: 99%
See 1 more Smart Citation
“…The proposed dual channel noise PSD estimator based on coherence in (20) shows different characteristics from the single-channel SPP-based noise PSD estimator in (7). Figure 3 shows one example of the noise power spectrum in the beamformer output and the estimates of it for Cafeteria noise at 5 dB SNR.…”
Section: Combining Noise Psd Estimates and Gain Calculationmentioning
confidence: 99%
“…Over the past decades, there has been a growing demand for speech enhancement using microphone arrays in speech processing applications such as automatic speech recognition, mobile communications, and hearing aids [ 1 , 2 , 3 , 4 ]. Multichannel speech enhancement aims to reduce the additive noise and improve the quality of the speech signals obtained by multiple microphones placed in a variety of acoustic environments [ 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 , 26 , 27 , 28 , 29 , 30 , 31 , 32 ]. In many multichannel speech enhancement systems, beamforming algorithms, such as the minimum-variance distortionless-response (MVDR) beamformer [ 11 ] and the general transfer function generalized sidelobe canceler (TF-GSC) [ 12 , 13 ], have been employed to extract a desired signal, exploiting spatial information on the location of the sound sources.…”
Section: Introductionmentioning
confidence: 99%
“…Additionally, we carried out an ablation study to analyze how much each module in the proposed system contributed to the performance improvement. We propose the speech PSD estimator, φ tcs,s s in (31), and the RTF estimator, g tdoa,s in (29). The previous approaches were the speech PSD estimator using recursive smoothing, φ ts s in (23), and the ML estimator of the RTF g ml in (25).…”
Section: Ablation Studymentioning
confidence: 99%
“…Speech enhancement is essential to ensure the satisfactory perceptual quality and intelligibility of speech signals in many speech applications, such as hearing aids and speech communication with mobile phones and hands-free systems [ 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 , 20 , 21 , 22 , 23 , 24 , 25 , 26 , 27 , 28 , 29 , 30 , 31 , 32 , 33 , 34 , 35 , 36 , 37 , 38 , 39 , 40 , 41 , 42 , 43 ]. Currently, devices with multiple microphones are popular, which has enabled multi-microphone speech enhancement, exploiting spatial information as well as spectro-temporal characteristics of the input signals [ 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 ,…”
Section: Introductionmentioning
confidence: 99%