2018
DOI: 10.1109/taslp.2018.2800525
|View full text |Cite
|
Sign up to set email alerts
|

Phase-Aware Single-Channel Speech Enhancement With Modulation-Domain Kalman Filtering

Abstract: Abstract-We present a speech enhancement algorithm that performs modulation-domain Kalman filtering to track the speech phase using circular statistics, along with the log-spectra of speech and noise. In the proposed algorithm, the speech phase posterior is used to create an enhanced speech phase spectrum for the signal reconstruction of speech. The Kalman filter prediction step separately models the temporal inter-frame correlation of the speech and noise spectral log-amplitudes and of the speech phase, while… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
51
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 25 publications
(51 citation statements)
references
References 101 publications
(296 reference statements)
0
51
0
Order By: Relevance
“…The phase factor in STFT bins, ↵ k , is given by ↵ k = cos( (k) (k)), as in [9], [7] and [8]. For clarity, we omit the time-frame index, t, below and we only include it in equations involving multiple frames.…”
Section: A Signal Model and Bark Bandsmentioning
confidence: 99%
See 3 more Smart Citations
“…The phase factor in STFT bins, ↵ k , is given by ↵ k = cos( (k) (k)), as in [9], [7] and [8]. For clarity, we omit the time-frame index, t, below and we only include it in equations involving multiple frames.…”
Section: A Signal Model and Bark Bandsmentioning
confidence: 99%
“…As shown in Fig. 1, decorrelation is performed before the KF update using B 2 < (p+q)⇥(p+q) , as in [8] and [3]. Decorrelation and recorrelation are performed before and after the nonlinear KF update step, respectively.…”
Section: The Phase-sensitive Kf Update Stepmentioning
confidence: 99%
See 2 more Smart Citations
“…This approach was extended in [10] to use an improved signal model in which speech and noise were additive in the complex STFT domain rather than in the spectral amplitude domain. This was further developed in [52] to minimise the squared error in the logspectral domain and to track the speech phase in addition to the speech and noise amplitudes. Non-linear KFs that take into account that speech and noise add in the STFT domain are formulated in [53], [54] and [11].…”
Section: Introductionmentioning
confidence: 99%