Interspeech 2020 2020
DOI: 10.21437/interspeech.2020-1753
|View full text |Cite
|
Sign up to set email alerts
|

ASR Error Correction with Augmented Transformer for Entity Retrieval

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
28
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 31 publications
(28 citation statements)
references
References 11 publications
0
28
0
Order By: Relevance
“…To quantify model robustness under noisy settings, we augmented ATIS and SNIP-Multi with environmental noise from MS-SNSD, which is a common scenario where users utter their spoken commands. The incentive here is how 'noise' was abused in some SLU literature, where ASR errors were treated as the noise source instead of modeling error, see [29,61]. Results on noisy test reveal that those work well on ATIS or SNIPS may break under realistic noises.…”
Section: Main Results On Clean and Noisy Slumentioning
confidence: 99%
“…To quantify model robustness under noisy settings, we augmented ATIS and SNIP-Multi with environmental noise from MS-SNSD, which is a common scenario where users utter their spoken commands. The incentive here is how 'noise' was abused in some SLU literature, where ASR errors were treated as the noise source instead of modeling error, see [29,61]. Results on noisy test reveal that those work well on ATIS or SNIPS may break under realistic noises.…”
Section: Main Results On Clean and Noisy Slumentioning
confidence: 99%
“…ASR error correction task is a well studied problem in literature and usually, it has been treated as a post processing task along with other tasks like punctuation prediction [15,16] and inverse text normalization [17]. The prior works have explored the problem using a variety of subtasks including grammar error correction, improving human readability [18], entity retrieval [19] etc. In [20], the authors use an RNN based external language model along with a stacked RNN based seq2seq spelling correction model for improving a baseline Listen Attend and Spell (LAS) based ASR system.…”
Section: Related Workmentioning
confidence: 99%
“…In [19], an Augmented Transformer model is proposed which leverages phonetic along with text for correcting ASR outputs. They show that jointly encoding both phoneme and text information helps in improving entity retrieval compared to a vanilla text transformer.…”
Section: Related Workmentioning
confidence: 99%
“…Grammatical Error Correction (GEC) aims to automatically detect and correct the grammatical errors that can be found in a sentence (Wang et al, 2020c). It is a crucial and essential application task in many natural language processing scenarios such as writing assistant (Ghufron and Rosyida, 2018;Napoles et al, 2017;Omelianchuk et al, 2020), search engine (Martins and Silva, 2004;Gao et al, 2010;Duan and Hsu, 2011), speech recognition systems (Karat et al, 1999;Wang et al, 2020a;Kubis et al, 2020), etc. Grammatical errors may appear in all languages (Dale et al, 2012;Xing et al, 2013;Ng et al, 2014;Rozovskaya et al, 2015;Bryant et al, 2019), in this paper, we only focus to tackle the problem of Chinese Grammatical Error Correction (CGEC) (Chang, 1995).…”
Section: Introductionmentioning
confidence: 99%