2023
DOI: 10.1109/taslp.2023.3252272
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Music Source Separation Using Differentiable Parametric Source Models

Abstract: Supervised deep learning approaches to underdetermined audio source separation achieve state-of-the-art performance but require a dataset of mixtures along with their corresponding isolated source signals. Such datasets can be extremely costly to obtain for musical mixtures. This raises a need for unsupervised methods. We propose a novel unsupervised modelbased deep learning approach to musical source separation. Each source is modelled with a differentiable parametric sourcefilter model. A neural network is t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
14
1

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 9 publications
(16 citation statements)
references
References 63 publications
(136 reference statements)
1
14
1
Order By: Relevance
“…The original model proposed in [1], referred to as UMSS, was shown to be efficient for complex source separation problems in which individual sources are not available for training, the sources are homogeneous (only singing voices), or only a limited amount of mixture recording data is obtainable. The approach is inspired by the recent hybrid deep learning paradigm, which integrates signal processing models in DNNs to incorporate domain knowledge [9,10].…”
Section: Unsupervised Music Source Separationmentioning
confidence: 99%
See 4 more Smart Citations
“…The original model proposed in [1], referred to as UMSS, was shown to be efficient for complex source separation problems in which individual sources are not available for training, the sources are homogeneous (only singing voices), or only a limited amount of mixture recording data is obtainable. The approach is inspired by the recent hybrid deep learning paradigm, which integrates signal processing models in DNNs to incorporate domain knowledge [9,10].…”
Section: Unsupervised Music Source Separationmentioning
confidence: 99%
“…The latter strategy obtains the best overall results and is then selected in this work. The model used in [1] to obtain source fundamental frequencies was given in [11] and performs multi-F0 extraction by first processing a spectral representation through a DNN, and then converting the output multi-frequency salience map to F0 contours. In [1], a voice assignment heuristic based on temporal pitch continuity was ©2024 IEEE.…”
Section: Unsupervised Music Source Separationmentioning
confidence: 99%
See 3 more Smart Citations