2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2017
DOI: 10.1109/waspaa.2017.8169995
|View full text |Cite
|
Sign up to set email alerts
|

Guiding audio source separation by video object information

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 16 publications
(18 citation statements)
references
References 15 publications
0
18
0
Order By: Relevance
“…This type of guidance has been shown to have a good impact on the source separation performance. Thus, in [12], the authors use visual guidance for improving source separation quality. Additionally, in a concurrent work [13], the authors explore a similar idea of class-conditioning over the joint embedded space, but unlike us, they use an auxiliary network to model parameters of a GMM for the final source separation, and they take spectrograms as an input of the model.…”
Section: Related Workmentioning
confidence: 99%
“…This type of guidance has been shown to have a good impact on the source separation performance. Thus, in [12], the authors use visual guidance for improving source separation quality. Additionally, in a concurrent work [13], the authors explore a similar idea of class-conditioning over the joint embedded space, but unlike us, they use an auxiliary network to model parameters of a GMM for the final source separation, and they take spectrograms as an input of the model.…”
Section: Related Workmentioning
confidence: 99%
“…The motions of players are often highly correlated with the characteristics of the sound sources [6]. There has been work on modeling such correlations for audio source separation [22]. Besides instrumental players, conductor gesture analysis has also been investigated in audiovisual music performance analysis.…”
Section: Table 1 a Categorization Of Existing Research On Audiovisuamentioning
confidence: 99%
“…Audio source separation in music recordings is a particularly interesting task, where audiovisual matching between the visual events of a performer's actions and their audio rendering can be of great value. Notably, such an approach enables addressing audio separation tasks that could not be performed in a unimodal fashion (solely analyzing the audio signal), as when considering two or more instances of the same instruments, say, a duet of guitars or violins, as done in the work of Parekh et al [22]. Knowing whether a musician is playing or not at a particular point in time gives important cues for source allocation.…”
Section: Music Source Separation Using Dynamic Correspondencementioning
confidence: 99%
See 2 more Smart Citations