2014
DOI: 10.1109/taslp.2014.2320637
|View full text |Cite
|
Sign up to set email alerts
|

Joint Mixing Vector and Binaural Model Based Stereo Source Separation

Abstract: In this paper the mixing vector (MV) in the statistical mixing model is compared to the binaural cues represented by interaural level and phase differences (ILD and IPD). It is shown that the MV distributions are quite distinct while binaural models overlap when the sources are close to each other. On the other hand, the binaural cues are more robust to high reverberation than MV models. According to this complementary behavior we introduce a new robust algorithm for stereo speech separation which considers bo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

1
82
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 34 publications
(87 citation statements)
references
References 33 publications
1
82
0
Order By: Relevance
“…For binaural recordings, the state-of-the-art blind source separation (BSS) methods [11,12,13] using interaural level difference (ILD) and interaural phase difference (IPD) have demonstrated good performance for two-channel source separation. These BSS methods can largely preserve binaural cues, as well as maintain the energy of each sound source, which is vital for speech intelligibility in noise.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For binaural recordings, the state-of-the-art blind source separation (BSS) methods [11,12,13] using interaural level difference (ILD) and interaural phase difference (IPD) have demonstrated good performance for two-channel source separation. These BSS methods can largely preserve binaural cues, as well as maintain the energy of each sound source, which is vital for speech intelligibility in noise.…”
Section: Introductionmentioning
confidence: 99%
“…1 illustrates the framework of the proposed system. A BSS algorithm [11,12,13] is applied to extract both the target and masker signals. To implement real-time source separation and intelligibility prediction, the separation model is trained offline.…”
Section: Introductionmentioning
confidence: 99%
“…Model-based blind source separation for exactly determined and underdetermined speech mixtures such as [1], [7], [8], [9], are more recent examples of applications in speech analysis involving frequency-specific GMMs. These methods have gained significant popularity due to their simple modelbased approach for integration of cues.…”
mentioning
confidence: 99%
“…More generally, this is due to the absence of an explicit model for reverberation. In addition to this, the frequency domain GMM in these algorithms, [1], [7], [8], relies on the assumption of the cues being independent. As noted in [8, sec.…”
mentioning
confidence: 99%
See 1 more Smart Citation