2020
DOI: 10.1109/lsp.2020.3039944
|View full text |Cite
|
Sign up to set email alerts
|

Flow-Based Independent Vector Analysis for Blind Source Separation

Abstract: This paper describes a time-varying extension of independent vector analysis (IVA) based on the normalizing flow (NF), called NF-IVA, for determined blind source separation of multichannel audio signals. As in IVA, NF-IVA estimates demixing matrices that transform mixture spectra to source spectra in the complex-valued spatial domain such that the likelihood of those matrices for the mixture spectra is maximized under some non-Gaussian source model. While IVA performs a time-invariant bijective linear transfor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 14 publications
(13 citation statements)
references
References 31 publications
0
13
0
Order By: Relevance
“…In the future, we will extend both the spatial and reverberation models to deal with time-varying acoustic environments (e.g., moving sources and microphones). One promising way is to use normalizing flows (NFs) for representing the diagonalizers and AR coefficient matrices in a time-varying manner as proposed for a determined BSS method called NF-IVA [57] with time-varying demixing matrices.…”
Section: Discussionmentioning
confidence: 99%
“…In the future, we will extend both the spatial and reverberation models to deal with time-varying acoustic environments (e.g., moving sources and microphones). One promising way is to use normalizing flows (NFs) for representing the diagonalizers and AR coefficient matrices in a time-varying manner as proposed for a determined BSS method called NF-IVA [57] with time-varying demixing matrices.…”
Section: Discussionmentioning
confidence: 99%
“…Considering the recent advance of deep learning techniques, one important future direction is to use a normalizing flow [52] for formulating an adaptive time-varying spatial model as proposed in [53]. Another complementary direction is to use a deep generative model of speech for improving the expression capability of the source model as proposed in [54], [55].…”
Section: Speech Separation With (Over)determined Configurationsmentioning
confidence: 99%
“…Besides supervised separation methods based on deep neural networks (DNNs) [3][4][5] that have been shown to work well, there is an increasing interest in DNN-based methods for semi-supervised separation and unsupervised separation, a.k.a. blind source separation (BSS), because of their potential in handling unseen sources in unknown environments [6][7][8][9][10].…”
Section: Introductionmentioning
confidence: 99%
“…Independent vector analysis (IVA) [12,13] is a classical BSS technique for determined separation case that decomposes M mixture STFT spectra (obtained from an M -channel recording) into spectra of N sources (N = M ) using time-invariant demixing matrices. By contrast, NF-IVA [8] uses timevarying demixing matrices represented by a normalizing flow (NF) [14]. It includes multilayer perceptrons (MLPs) that are optimized from scratch at run-time with backpropagation (BP) [15] given only the observed mixture.…”
Section: Introductionmentioning
confidence: 99%