2018
DOI: 10.48550/arxiv.1803.09816
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Spectral feature mapping with mimic loss for robust speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…( 1). Compared to IRM (L1), IRM (CGAN) has an additional adversarial loss term with λ = 0.01 as in (Bagchi et al, 2018;Pascual et al, 2017). A parameter exploring policy gradients (Sehnke et al, 2010) based black-box optimization, which is similar to the one used in (Zhang et al, 2018), is also compared.…”
Section: Objective Evaluation With Different Loss Functionsmentioning
confidence: 99%
See 1 more Smart Citation
“…( 1). Compared to IRM (L1), IRM (CGAN) has an additional adversarial loss term with λ = 0.01 as in (Bagchi et al, 2018;Pascual et al, 2017). A parameter exploring policy gradients (Sehnke et al, 2010) based black-box optimization, which is similar to the one used in (Zhang et al, 2018), is also compared.…”
Section: Objective Evaluation With Different Loss Functionsmentioning
confidence: 99%
“…The deep models were then optimized by minimizing the distance between generated speech and clean speech. However, the distance (objective function) is usually based on simple L p loss (where p = 1 or 2), which does not reflect human auditory perception or ASR accuracy (Bagchi et al, 2018) well. In fact, several researches have indicated that an enhanced speech with a smaller L p distance, does not guarantee a higher quality or intelligibility score (Fu et al, 2018b;Koizumi et al, 2018).…”
Section: Introductionmentioning
confidence: 99%