ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
DOI: 10.1109/icassp.2019.8683410
|View full text |Cite
|
Sign up to set email alerts
|

Multi-channel Itakura Saito Distance Minimization with Deep Neural Network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1

Relationship

5
2

Authors

Journals

citations
Cited by 19 publications
(20 citation statements)
references
References 21 publications
0
20
0
Order By: Relevance
“…The loss function for the DNN training is set to a divergence between two posterior PDFs, i.e., the posterior PDF estimated by [12] is utilized similarly to conventional supervised speech source separation [13,24], Π is a set of possible permutations, and…”
Section: Loss Function For Deep Neural Network Trainingmentioning
confidence: 99%
“…The loss function for the DNN training is set to a divergence between two posterior PDFs, i.e., the posterior PDF estimated by [12] is utilized similarly to conventional supervised speech source separation [13,24], Π is a set of possible permutations, and…”
Section: Loss Function For Deep Neural Network Trainingmentioning
confidence: 99%
“…In Section 3.1, we extend WA for applying it to our proposed system, which is defined in the time-domain. Section 3.2 describes another objective function calculated by a sum of the original multi-channel objective function [14] and a consistencyaware objective function defined in the T-F domain. Both proposed objective functions are summarized in Fig.…”
Section: Proposed Multi-channel Speech Enhancement Systemmentioning
confidence: 99%
“…1. After reviewing a multichannel loss function for timevarying MWF [28], the proposed time-invariant mask-based beamforming is introduced, which is based on the same loss function used in [28]. Since the loss function focuses on the time-varying MWF, it requires the estimated time-varying activation which is redundant for time-invariant beamforming.…”
Section: Proposed Mask-based Beamforming With Multichannel Loss Functionmentioning
confidence: 99%
“…For time-varying MWF, we proposed a multichannel loss function which evaluates the estimated time-varying spatial covariance matricesR t,f,n . In [28], a DNN estimates the timevarying activation and TF-mask. Based on DNN's outputs, the time-varying spatial covariance matrices are calculated aŝ R t,f,n =v t,f,nRf,n whereR f,n is given by Eq.…”
Section: Multichannel Loss Function For Time-varying Mwf [28]mentioning
confidence: 99%
See 1 more Smart Citation