2013 IEEE International Conference on Acoustics, Speech and Signal Processing 2013
DOI: 10.1109/icassp.2013.6637773
|View full text |Cite
|
Sign up to set email alerts
|

Prediction based filtering and smoothing to exploit temporal dependencies in NMF

Abstract: Nonnegative matrix factorization is an appealing technique for many audio applications. However, in it's basic form it does not use temporal structure, which is an important source of information in speech processing. In this paper, we propose NMF-based filtering and smoothing algorithms that are related to Kalman filtering and smoothing. While our prediction step is similar to that of Kalman filtering, we develop a multiplicative update step which is more convenient for nonnegative data analysis and in line w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
26
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
5
3
1

Relationship

2
7

Authors

Journals

citations
Cited by 15 publications
(27 citation statements)
references
References 15 publications
1
26
0
Order By: Relevance
“…The addition of temporal modeling in other NMF-based methods [23,24] has been shown to improve noticeably source separation quality. Other potential improvements include the adaptive selection of some of the parameters, such as the number of noise spectral features KN or the sparsity parameter λ.…”
Section: Resultsmentioning
confidence: 99%
“…The addition of temporal modeling in other NMF-based methods [23,24] has been shown to improve noticeably source separation quality. Other potential improvements include the adaptive selection of some of the parameters, such as the number of noise spectral features KN or the sparsity parameter λ.…”
Section: Resultsmentioning
confidence: 99%
“…The algorithms in [38] track the time evolution of the clean STFT amplitude domain coefficients in every frequency bin. In [64], speech inter-frame correlation is modeled. Considering KF algorithms, many papers, such as [50] [51] and [53], use the non-linear observation model relating clean and noisy speech in the log-spectral domain.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The modulation domain models the inter-frame correlation of clean speech and does not consider each time-frame independently. In [64], inter-frame speech correlation is modeled and is then followed by NMF. Inter-frame correlations of speech are considered in several papers and books by J. Benesty, i.e.…”
Section: Literature Reviewmentioning
confidence: 99%
“…A linear nonnegative dynamical system is presented in [38] to model temporal dependencies in NMF. The proposed causal filtering and fixed-lag smoothing algorithms use Kalmanlike prediction in NMF and PLCA.…”
Section: Review Of State-of-the-art Nmf-based Speech Enhancementmentioning
confidence: 99%