2021
DOI: 10.1016/j.apacoust.2020.107566
|View full text |Cite
|
Sign up to set email alerts
|

Clustering of spatial cues by semantic segmentation for anechoic binaural source separation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
27
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 7 publications
(27 citation statements)
references
References 19 publications
0
27
0
Order By: Relevance
“…Removing the phase wrap problem by using the top down approach of [2] did not work well in case of U-Net, as it reduces the IPD variance of each source. This variance reduction works well with the expectation maximization (EM) algorithm [2] but not for the convolutional neural network U-Net [11]. The inclusion of IPD cues (whether the values observed from the mixture or those after the phase unwrap by the top down approach) in SONET [11], resulted in decline of its output performance.…”
Section: Related Workmentioning
confidence: 94%
See 4 more Smart Citations
“…Removing the phase wrap problem by using the top down approach of [2] did not work well in case of U-Net, as it reduces the IPD variance of each source. This variance reduction works well with the expectation maximization (EM) algorithm [2] but not for the convolutional neural network U-Net [11]. The inclusion of IPD cues (whether the values observed from the mixture or those after the phase unwrap by the top down approach) in SONET [11], resulted in decline of its output performance.…”
Section: Related Workmentioning
confidence: 94%
“…This variance reduction works well with the expectation maximization (EM) algorithm [2] but not for the convolutional neural network U-Net [11]. The inclusion of IPD cues (whether the values observed from the mixture or those after the phase unwrap by the top down approach) in SONET [11], resulted in decline of its output performance. The performance comparison of the two speech separation models, one using the EM algorithm and the other using the SONET-P network for clustering the IPD cues, is given in the 'experiment' section (Section V).…”
Section: Related Workmentioning
confidence: 94%
See 3 more Smart Citations