2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016
DOI: 10.1109/icassp.2016.7471634
|View full text |Cite
|
Sign up to set email alerts
|

Complex NMF under phase constraints based on signal modeling: Application to audio source separation

Abstract: Nonnegative Matrix Factorization (NMF) is a powerful tool for decomposing mixtures of audio signals in the Time-Frequency (TF) domain. In the source separation framework, the phase recovery for each extracted component is necessary for synthesizing time-domain signals. The Complex NMF (CNMF) model aims to jointly estimate the spectrogram and the phase of the sources, but requires to constrain the phase in order to produce satisfactory sounding results. We propose to incorporate phase constraints based on signa… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
19
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2

Relationship

4
3

Authors

Journals

citations
Cited by 14 publications
(21 citation statements)
references
References 19 publications
2
19
0
Order By: Relevance
“…This model is applied to a musical source separation task in a semiinformed setting. It outperforms both the traditional phaseunaware ISNMF and the phase-constrained CNMF model [25]. This demonstrates the usefulness of such a phase-aware Bayesian AG model to perform the joint estimation of magnitudes and phases for audio source separation.…”
Section: Introductionmentioning
confidence: 76%
See 2 more Smart Citations
“…This model is applied to a musical source separation task in a semiinformed setting. It outperforms both the traditional phaseunaware ISNMF and the phase-constrained CNMF model [25]. This demonstrates the usefulness of such a phase-aware Bayesian AG model to perform the joint estimation of magnitudes and phases for audio source separation.…”
Section: Introductionmentioning
confidence: 76%
“…We remark that if κ = 0, then λ = ρ = 0: therefore, q j,f t = 0 and p j,f t becomes the posterior power of s j,f t , as mentioned in Section III-C1. Then, we recognize in (25) the IS divergence between P j and W j H j , as in the EM algorithm for ISNMF [46]. Consequently, the updates rules (36) and (37) are similar to those obtained in such a scenario [46], up to an additional power 1/2, which is common when applying the majorizeminimization methodology for estimating ISNMF [44].…”
Section: ) Relation To Other Approachesmentioning
confidence: 88%
See 1 more Smart Citation
“…The goal is to estimate the quadratic terms in (8) and (9) from the reverberant signal y(t) defined in (3), in order to resynthesize the anechoic signal s(t). To do so, we use the fact that if the RIR h(t) is modeled as in Section II-C, then for any analog signals x 1 (t) and x 2 (t):…”
Section: Estimation From a Reverberant Signalmentioning
confidence: 99%
“…This is the main drawback of these methods, because using this corrupted phase reintroduces reverberation and distortion in the signal, as shown in [8]. In the source separation literature, the idea of modeling the phase has recently been proposed [9] because a similar problem occurs (the phase of the mixture is generally used to synthesize the source signals), but not for dereverberation.…”
Section: Introductionmentioning
confidence: 99%