2019
DOI: 10.25560/80134
|View full text |Cite
|
Sign up to set email alerts
|

Mask-based enhancement of very noisy speech

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 124 publications
0
1
0
Order By: Relevance
“…In this algorithm, spectral features extracted from the M output of the beamformer form the inputs to a DNN which has been trained to identify the TF regions that contain significant target speech energy. An overview of the mask estimation procedure is given below; full details of the neural network structure, input features and training regime are given in Moore et al (2018) and Lightburn (2020). The input features of the neural net were derived from a 90-channel cochleagram calculated using 25.6 ms frames with 50% overlap.…”
Section: Signal Processingmentioning
confidence: 99%
“…In this algorithm, spectral features extracted from the M output of the beamformer form the inputs to a DNN which has been trained to identify the TF regions that contain significant target speech energy. An overview of the mask estimation procedure is given below; full details of the neural network structure, input features and training regime are given in Moore et al (2018) and Lightburn (2020). The input features of the neural net were derived from a 90-channel cochleagram calculated using 25.6 ms frames with 50% overlap.…”
Section: Signal Processingmentioning
confidence: 99%