ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414038
|View full text |Cite
|
Sign up to set email alerts
|

Reverb Conversion Of Mixed Vocal Tracks Using An End-To-End Convolutional Deep Neural Network

Abstract: Reverb plays a critical role in music production, where it provides listeners with spatial realization, timbre, and texture of the music. Yet, it is challenging to reproduce the musical reverb of a reference music track even by skilled engineers. In response, we propose an end-to-end system capable of switching the musical reverb factor of two different mixed vocal tracks. This method enables us to apply the reverb of the reference track to the source track to which the effect is desired. Further, our model ca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
5
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 19 publications
(19 reference statements)
0
5
0
Order By: Relevance
“…Comparison methods. We evaluate the proposed method against three baselines: Reverb Conversion (RC) (Koo et al, 2021), Music Enhancement (ME) (Kandpal et al, 2022), and Unsupervised Dereverberation (UD) (Saito et al, 2022). RC is a state-of-the-art, end-to- (Koo et al, 2021) 5.69 0.02 7.23 Music Enhancement (Kandpal et al, 2022) 7.51 −23.9 7.92 Unsupervised Dereverberation (Saito et al, 2022) 4 Evaluation metrics.…”
Section: Vocal Dereverberationmentioning
confidence: 99%
See 2 more Smart Citations
“…Comparison methods. We evaluate the proposed method against three baselines: Reverb Conversion (RC) (Koo et al, 2021), Music Enhancement (ME) (Kandpal et al, 2022), and Unsupervised Dereverberation (UD) (Saito et al, 2022). RC is a state-of-the-art, end-to- (Koo et al, 2021) 5.69 0.02 7.23 Music Enhancement (Kandpal et al, 2022) 7.51 −23.9 7.92 Unsupervised Dereverberation (Saito et al, 2022) 4 Evaluation metrics.…”
Section: Vocal Dereverberationmentioning
confidence: 99%
“…We evaluate the proposed method against three baselines: Reverb Conversion (RC) (Koo et al, 2021), Music Enhancement (ME) (Kandpal et al, 2022), and Unsupervised Dereverberation (UD) (Saito et al, 2022). RC is a state-of-the-art, end-to- (Koo et al, 2021) 5.69 0.02 7.23 Music Enhancement (Kandpal et al, 2022) 7.51 −23.9 7.92 Unsupervised Dereverberation (Saito et al, 2022) 4 Evaluation metrics. For quantitative comparison of the different methods, the metrics are the scale-invariant signalto-distortion ratio (SI-SDR) (Roux et al, 2019) improvement, the Fréchet Audio Distance (FAD) (Kilgour et al, 2018), and the speech-to-reverberation modulation energy ratio (SRMR) (Santos et al, 2014).…”
Section: Vocal Dereverberationmentioning
confidence: 99%
See 1 more Smart Citation
“…An end-to-end approach of converting black-box music effects with already processed audio tracks comes in need upon the loss of original dry source tracks or unavailability of replicating the setup of the mastering chain at the time, which may occur especially with old recordings. [8] introduced a system that interchanges the musical reverberant effects of two differently processed vocal tracks yet required massive storage of already-processed data to train. We propose an end-to-end remastering system that thoroughly converts the originally mastered effects to the desired style.…”
Section: Related Workmentioning
confidence: 99%
“…Blind estimation of the room impulse response from reverberant speech has also been explored [57,65]. In music production, acoustic matching is applied to change the reverberation to emulate that from a target space or processing algorithm [35,51]. Recent work conditions the target-audio generation on a low-dimensional audio embedding [59].…”
Section: Related Workmentioning
confidence: 99%