2021
DOI: 10.48550/arxiv.2103.02644
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Compute and memory efficient universal sound source separation

Efthymios Tzinis,
Zhepei Wang,
Xilin Jiang
et al.

Abstract: Recent progress in audio source separation lead by deep learning has enabled many neural network models to provide robust solutions to this fundamental estimation problem. In this study, we provide a family of efficient neural network architectures for general purpose audio source separation while focusing on multiple computational aspects that hinder the application of neural networks in real-world scenarios. The backbone structure of this convolutional network is the SUccessive DOwnsampling and Resampling of… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 33 publications
0
2
0
Order By: Relevance
“…In future works, we will consider training the system on a larger dataset, which may include audio clips with multiple AE classes, or unlabeled data [8,9]. Moreover, we will also investigate other EL [13] to provide more discriminative embedding vectors and extend the model to perform online processing [30].…”
Section: Discussionmentioning
confidence: 99%
“…In future works, we will consider training the system on a larger dataset, which may include audio clips with multiple AE classes, or unlabeled data [8,9]. Moreover, we will also investigate other EL [13] to provide more discriminative embedding vectors and extend the model to perform online processing [30].…”
Section: Discussionmentioning
confidence: 99%
“…We force the estimated sources to add up to the input mixture using a mixture consistency layer [28] at the output of our separation model. For all the other parameters we choose the default settings provided in [29] for a sampling rate of 16k Hz.…”
Section: Separation Modelmentioning
confidence: 99%