2020
DOI: 10.48550/arxiv.2011.02090
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Frustratingly Easy Noise-aware Training of Acoustic Models

Desh Raj,
Jesus Villalba,
Daniel Povey
et al.

Abstract: Environmental noises and reverberation have a detrimental effect on the performance of automatic speech recognition (ASR) systems. Multi-condition training of neural networkbased acoustic models is used to deal with this problem, but it requires many-folds data augmentation, resulting in increased training time. In this paper, we propose utterance-level noise vectors for noise-aware training of acoustic models in hybrid ASR. Our noise vectors are obtained by combining the means of speech frames and silence fra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 20 publications
(29 reference statements)
0
2
0
Order By: Relevance
“…In [20], the invariant representation learning technique was proposed, which demonstrated significant reduction in character error rate and robustness for out-ofdomain noise settings. In [21], a simple method was considered to extract a noise vector for acoustic model training. It is suggested that the technique could also be applied in online ASR by estimating the mean vector with frame-level maximum likelihood.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In [20], the invariant representation learning technique was proposed, which demonstrated significant reduction in character error rate and robustness for out-ofdomain noise settings. In [21], a simple method was considered to extract a noise vector for acoustic model training. It is suggested that the technique could also be applied in online ASR by estimating the mean vector with frame-level maximum likelihood.…”
Section: Related Workmentioning
confidence: 99%
“…Factor aware training has been shown to be effective in ASR system development [5,19,21]. This training strategy produces a system that is more robust to factors such as noise, speaker, and room characteristics.…”
Section: Scenario Awarementioning
confidence: 99%