2014
DOI: 10.1109/msp.2013.2297440
|View full text |Cite
|
Sign up to set email alerts
|

From Blind to Guided Audio Source Separation: How models and side information can improve the separation of sound

Abstract: Audio is a domain where signal separation has long been considered as a fascinating objective, potentially offering a wide range of new possibilities and experiences in professional and personal contexts, by better taking advantage of audio material and finely analyzing complex acoustic scenes. It has thus always been a major area for research in signal separation and an exciting challenge for industrial applications. Starting with blind separation of toy mixtures in the mid 90's, research has progressed up to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
108
0
5

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 133 publications
(113 citation statements)
references
References 30 publications
0
108
0
5
Order By: Relevance
“…We refer the reader to our web page for listening to audio examples illustrating the results discussed below 1 . Matlab code implementing the proposed method is also available.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We refer the reader to our web page for listening to audio examples illustrating the results discussed below 1 . Matlab code implementing the proposed method is also available.…”
Section: Methodsmentioning
confidence: 99%
“…Indeed, model-based approaches can take advantage of the very particular structure of audio signals in the TF plane [1]. For example, sparse component analysis methods [2] exploit the sparsity of the source signals in the TF domain.…”
Section: Introductionmentioning
confidence: 99%
“…In some recent research it is shown that reverberation can be exploited to obtain information about the recording environment such as room geometry and source locations [8,9,10,11]. Inspired by those works and given that clustered and informed approaches are shown to yield better results in ad-hoc microphone array contexts [1,2,6,12], this research is trying to find a setup independent approach to choose a subset of microphones that yields higher quality outputs in terms of echo cancellation. In order to achieve this goal the first step is to extract discriminative features from speech signals to choose a subset of single microphones (Section III.A).…”
Section: Apply the Unsupervised Clustering Algorithm On Discriminativmentioning
confidence: 99%
“…It is a topic that has many applications in the entertainment industry such as automatic karaoke [19], [29], music upmixing [21], [22], [23] or audio restoration [31]. For this reason, it has gathered the attention of a large community of researchers in the past 15 years [35], [34]. The inherent difficulty of audio source separation comes from the fact that it is essentially an ill-posed inverse problem: in a typical setting, we have more signals to estimate than the number of signals we observe.…”
Section: Introductionmentioning
confidence: 99%
“…This track of research increasingly showed that prior assumptions about the sources leads to improved performance in practice [33], [16], [34]. The common ground of most related work on audio source separation then becomes the building of models for the spectrograms of the sources that have a strong expressive power while requiring the fitting of only a small number of parameters.…”
Section: Introductionmentioning
confidence: 99%