2021
DOI: 10.1109/lsp.2021.3101699
|View full text |Cite
|
Sign up to set email alerts
|

Neural Full-Rank Spatial Covariance Analysis for Blind Source Separation

Abstract: This paper describes a neural blind source separation (BSS) method based on amortized variational inference (AVI) of a non-linear generative model of mixture signals. A classical statistical approach to BSS is to fit a linear generative model that consists of spatial and source models representing the interchannel covariances and power spectral densities of sources, respectively. Although the variational autoencoder (VAE) has successfully been used as a non-linear source model with latent features, it should b… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 32 publications
0
9
0
Order By: Relevance
“…The deep spectral model can be trained in an unsupervised manner by using only multichannel mixtures based on amortized variational inference [21], [33]. This method called neural FCA utilizes an inference (encoder) network to predict the speech features Z ≜ {z nt } N,T n,t=1 from an input mixture X ≜ {x f t } F,T f,t=1 as the posterior distribution q ϕ (Z | X), where ϕ represents the network parameters.…”
Section: Neural Full-rank Spatial Covariance Analysismentioning
confidence: 99%
See 4 more Smart Citations
“…The deep spectral model can be trained in an unsupervised manner by using only multichannel mixtures based on amortized variational inference [21], [33]. This method called neural FCA utilizes an inference (encoder) network to predict the speech features Z ≜ {z nt } N,T n,t=1 from an input mixture X ≜ {x f t } F,T f,t=1 as the posterior distribution q ϕ (Z | X), where ϕ represents the network parameters.…”
Section: Neural Full-rank Spatial Covariance Analysismentioning
confidence: 99%
“…1). As the estimates of z nt , following the original neural FCA [21], the inference model predicts the posterior distribution q ϕ (Z | X) as follows:…”
Section: B Inference Modelmentioning
confidence: 99%
See 3 more Smart Citations