ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414965
|View full text |Cite
|
Sign up to set email alerts
|

A Modulation-Domain Loss for Neural-Network-Based Real-Time Speech Enhancement

Abstract: We describe a modulation-domain loss function for deeplearning-based speech enhancement systems. Learnable spectro-temporal receptive fields (STRFs) were adapted to optimize for a speaker identification task. The learned STRFs were then used to calculate a weighted mean-squared error (MSE) in the modulation domain for training a speech enhancement system. Experiments showed that adding the modulation-domain MSE to the MSE in the spectro-temporal domain substantially improved the objective prediction of speech … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(1 citation statement)
references
References 32 publications
(44 reference statements)
0
1
0
Order By: Relevance
“…The final output of the first step is Y^(LPS). And, the final output of the second step is < YR. Waveform reconstruction is a type of step in the enhancement stage [20]. The output from the RNN-LSTM training XLPS is fed into the Exp (), which corresponds to the process of exponential the input.…”
Section: ░ 3 Block Diagrammentioning
confidence: 99%
“…The final output of the first step is Y^(LPS). And, the final output of the second step is < YR. Waveform reconstruction is a type of step in the enhancement stage [20]. The output from the RNN-LSTM training XLPS is fed into the Exp (), which corresponds to the process of exponential the input.…”
Section: ░ 3 Block Diagrammentioning
confidence: 99%