ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9053357
|View full text |Cite
|
Sign up to set email alerts
|

Hydranet: A Real-Time Waveform Separation Network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 9 publications
0
2
0
Order By: Relevance
“…Thus, an architecture similar to the Wave-U-Net [4] was used, replacing the convolutional bottleneck of the network with a recurrent module. The recurrent path consists of two bidirectional LSTM layers, as also proposed in [10], [18], of 168 units each, and a LeakyReLU activation after the second layer. Also, in order to keep the computational costs low, the number of filters in both the encoder and the decoder were halved compared to the original implementation [4].…”
Section: B Denoising Networkmentioning
confidence: 99%
“…Thus, an architecture similar to the Wave-U-Net [4] was used, replacing the convolutional bottleneck of the network with a recurrent module. The recurrent path consists of two bidirectional LSTM layers, as also proposed in [10], [18], of 168 units each, and a LeakyReLU activation after the second layer. Also, in order to keep the computational costs low, the number of filters in both the encoder and the decoder were halved compared to the original implementation [4].…”
Section: B Denoising Networkmentioning
confidence: 99%
“…Specifically, a temporal convolutional network for real time speech enhancement has been proposed in [27] while the latest state-of-the-art performance has been obtained by a real-time variation of Demucs for online speech denoising [6]. In a similar sense real-time music source separation models have been proposed in [9] and a system capable of real-time speech separation from background music has been implemented in [14].…”
Section: Introductionmentioning
confidence: 99%