2023
DOI: 10.3390/math11030622
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Representation Learning with Task-Agnostic Feature Masking for Robust End-to-End Speech Recognition

Abstract: Unsupervised learning-based approaches for training speech vector representations (SVR) have recently been widely applied. While pretrained SVR models excel in relatively clean automatic speech recognition (ASR) tasks, such as those recorded in laboratory environments, they are still insufficient for practical applications with various types of noise, intonation, and dialects. To cope with this problem, we present a novel unsupervised SVR learning method for practical end-to-end ASR models. Our approach involv… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 44 publications
0
3
0
Order By: Relevance
“…v) On-the-fly VR-SBE plus FBK+VR-SBE driven f-LHUC adaptation (Sys.12) not only outperforms the comparable baselines replacing VR-SBE with iVector or xVector (Sys. [10][11], but also gives further WER reductions over VR-SBE adaptation alone by 0.64% abs. (2.03% rel.)…”
Section: ) Performance Analysesmentioning
confidence: 96%
See 2 more Smart Citations
“…v) On-the-fly VR-SBE plus FBK+VR-SBE driven f-LHUC adaptation (Sys.12) not only outperforms the comparable baselines replacing VR-SBE with iVector or xVector (Sys. [10][11], but also gives further WER reductions over VR-SBE adaptation alone by 0.64% abs. (2.03% rel.)…”
Section: ) Performance Analysesmentioning
confidence: 96%
“…significant improvement (α = 0.05) obtained over iVector (Sys. 2,10,14,22,27), xVector (Sys. 3,11,15,23,28), or both.…”
Section: A Experiments On Dysarthric Speechmentioning
confidence: 99%
See 1 more Smart Citation