2017
DOI: 10.1109/taslp.2016.2641904
|View full text |Cite
|
Sign up to set email alerts
|

Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise

Abstract: This paper proposes an online single-channel speech enhancement method designed to improve the quality of speech degraded by reverberation and noise. Based on an autoregressive model for the reverberation power and on a hidden Markov model for clean speech production, a Bayesian filtering formulation of the problem is derived and online joint estimation of the acoustic parameters and mean speech, reverberation, and noise powers is obtained in mel-frequency bands. From these estimates, a realvalued spectral gai… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
53
0
4

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
3

Relationship

3
6

Authors

Journals

citations
Cited by 34 publications
(57 citation statements)
references
References 48 publications
0
53
0
4
Order By: Relevance
“…The SPENDRED algorithm, which is presented in [34] [35], performs time-varying T 60 and DRR estimation and it internally (and not externally) estimates T 60 and DRR at every time step. However, unless the source or the microphone are moving around, the T 60 and DRR will presumably be constant throughout the recording.…”
Section: Additional Literature Reviewmentioning
confidence: 99%
See 1 more Smart Citation
“…The SPENDRED algorithm, which is presented in [34] [35], performs time-varying T 60 and DRR estimation and it internally (and not externally) estimates T 60 and DRR at every time step. However, unless the source or the microphone are moving around, the T 60 and DRR will presumably be constant throughout the recording.…”
Section: Additional Literature Reviewmentioning
confidence: 99%
“…Estimating frequency-dependent reverberation parameters is beneficial. Reverberation is frequency dependent and obtaining a T 60 estimate for each individual frequency bin, or for every Melspaced frequency band as in [34] [35], is advantageous.…”
Section: Additional Literature Reviewmentioning
confidence: 99%
“…2.1, we do KF noise tracking in the log-power spectral domain based on AR(r) modeling and on the estimated SNR in the modulation frame [3]. After the noise KF prediction step, we decorrelate the joint KF state and, then, we multiply the noise log-power Gaussian with the Gaussian that is obtained from external noise estimation and log-normal noise power modeling [20] [21].…”
Section: Kf Noise Tracking and The Joint Speech-noise Kf Statementioning
confidence: 99%
“…After the noise KF prediction, we decorrelate the noise KF state and, then, we multiply the noise log-power Gaussian with the Gaussian that is obtained from external noise estimation and log-normal noise power modeling [25] [26]. As in (1) that describes the speech KF prediction, for the noise, (n), KF prediction:…”
Section: Noise Tracking and The Speech-noise Kfmentioning
confidence: 99%