Phase-Aware Single-Channel Speech Enhancement With Modulation-Domain Kalman Filtering

Dionelis, Nikolaos; Brookes, Mike

doi:10.1109/taslp.2018.2800525

Cited by 25 publications

(51 citation statements)

References 101 publications

(296 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The phase factor in STFT bins, ↵ k , is given by ↵ k = cos( (k) (k)), as in [9], [7] and [8]. For clarity, we omit the time-frame index, t, below and we only include it in equations involving multiple frames.…”

Section: A Signal Model and Bark Bandsmentioning

confidence: 99%

“…As shown in Fig. 1, decorrelation is performed before the KF update using B 2 < (p+q)⇥(p+q) , as in [8] and [3]. Decorrelation and recorrelation are performed before and after the nonlinear KF update step, respectively.…”

Section: The Phase-sensitive Kf Update Stepmentioning

confidence: 99%

“…In (8), the outer integration over the phase factor in Bark bands, l , is performed using G sigma points. E{ z l }, as computed in Sec.…”

Section: The Phase-sensitive Kf Update Stepmentioning

confidence: 99%

“…We approximate the posterior of the speech and noise spectral log-powers as a two-dimensional Gaussian distribution with a full covariance matrix using the probability distribution of the phase factor in Bark bands. The phasesensitive KF update step computes the first two moments of the posterior distribution, [8], [7], [9], suppressing noise.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Speech Enhancement Using Kalman Filtering in the Logarithmic Bark Power Spectral Domain

Dionelis

Brookes

2018

2018 26th European Signal Processing Conference (EUSIPCO)

Self Cite

View full text Add to dashboard Cite

We present a phase-sensitive speech enhancement algorithm based on a Kalman filter estimator that tracks speech and noise in the logarithmic Bark power spectral domain. With modulation-domain Kalman filtering, the algorithm tracks the speech spectral log-power using perceptually-motivated Bark bands. By combining STFT bins into Bark bands, the number of frequency components is reduced. The Kalman filter prediction step separately models the inter-frame relations of the speech and noise spectral log-powers and the Kalman filter update step models the nonlinear relations between the speech and noise spectral log-powers using the phase factor in Bark bands, which follows a sub-Gaussian distribution. The posterior mean of the speech spectral log-power is used to create an enhanced speech spectrum for signal reconstruction. The algorithm is evaluated in terms of speech quality and computational complexity with different algorithm configurations compared on various noise types. The algorithm implemented in Bark bands is compared to algorithms implemented in STFT bins and experimental results show that tracking speech in the log Bark power spectral domain, taking into account the temporal dynamics of each subband envelope, is beneficial. Regarding the computational complexity, the percentage decrease in the real-time factor is 44% when using Bark bands compared to when using STFT bins.

show abstract

Section: A Signal Model and Bark Bandsmentioning

confidence: 99%

Section: The Phase-sensitive Kf Update Stepmentioning

confidence: 99%

“…In (8), the outer integration over the phase factor in Bark bands, l , is performed using G sigma points. E{ z l }, as computed in Sec.…”

Section: The Phase-sensitive Kf Update Stepmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Speech Enhancement Using Kalman Filtering in the Logarithmic Bark Power Spectral Domain

Dionelis

Brookes

2018

2018 26th European Signal Processing Conference (EUSIPCO)

Self Cite

View full text Add to dashboard Cite

show abstract

“…This approach was extended in [10] to use an improved signal model in which speech and noise were additive in the complex STFT domain rather than in the spectral amplitude domain. This was further developed in [52] to minimise the squared error in the logspectral domain and to track the speech phase in addition to the speech and noise amplitudes. Non-linear KFs that take into account that speech and noise add in the STFT domain are formulated in [53], [54] and [11].…”

Section: Introductionmentioning

confidence: 99%

Modulation-Domain Kalman Filtering for Monaural Blind Speech Denoising and Dereverberation

Dionelis

Brookes

2019

IEEE/ACM Trans. Audio Speech Lang. Process.

Self Cite

View full text Add to dashboard Cite

We describe a monaural speech enhancement algorithm based on modulation-domain Kalman filtering to blindly track the time-frequency log-magnitude spectra of speech and reverberation. We propose an adaptive algorithm that performs blind joint denoising and dereverberation, while accounting for the inter-frame speech dynamics, by estimating the posterior distribution of the speech log-magnitude spectrum given the log-magnitude spectrum of the noisy reverberant speech. The Kalman filter update step models the non-linear relations between the speech, noise and reverberation log-spectra. The Kalman filtering algorithm uses a signal model that takes into account the reverberation parameters of the reverberation time, T60, and the direct-to-reverberant energy ratio (DRR) and also estimates and tracks the T60 and the DRR in every frequency bin to improve the estimation of the speech log-spectrum. The proposed algorithm is evaluated in terms of speech quality, speech intelligibility and dereverberation performance for a range of reverberation parameters and reverberant speech to noise ratios, in different noises, and is also compared to competing denoising and dereverberation techniques. Experimental results using noisy reverberant speech demonstrate the effectiveness of the enhancement algorithm.

show abstract

Method of Real-Time Speaker Identifying by Voice

Shumskaya

2021

Lecture Notes in Electrical Engineering

View full text Add to dashboard Cite

Phase-Aware Single-Channel Speech Enhancement With Modulation-Domain Kalman Filtering

Cited by 25 publications

References 101 publications

Speech Enhancement Using Kalman Filtering in the Logarithmic Bark Power Spectral Domain

Speech Enhancement Using Kalman Filtering in the Logarithmic Bark Power Spectral Domain

Modulation-Domain Kalman Filtering for Monaural Blind Speech Denoising and Dereverberation

Method of Real-Time Speaker Identifying by Voice

Contact Info

Product

Resources

About