2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) 2021
DOI: 10.1109/asru51503.2021.9688282
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Task Language Modeling for Improving Speech Recognition of Rare Words

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 26 publications
0
2
0
Order By: Relevance
“…[56] conducted research on the possibility of directly attaching an audio encoder to LLMs to convert them into automatic speech recognition (ASR) systems, allowing their text counterparts to be used in the exact same manner. It has also been confirmed that when combining LLM with prompt engineering and fine-tuning, they can function as post-recognition processors for speech, conducting revision and error adjustment [57]. Topic 3 was labelled as "Research on tuning LLM for improving the efficiency" through "high," "time," "parameter," "structure," "simulation," and "flow."…”
Section: Results Of the Topic Analysis For Web Of Sciencementioning
confidence: 96%
“…[56] conducted research on the possibility of directly attaching an audio encoder to LLMs to convert them into automatic speech recognition (ASR) systems, allowing their text counterparts to be used in the exact same manner. It has also been confirmed that when combining LLM with prompt engineering and fine-tuning, they can function as post-recognition processors for speech, conducting revision and error adjustment [57]. Topic 3 was labelled as "Research on tuning LLM for improving the efficiency" through "high," "time," "parameter," "structure," "simulation," and "flow."…”
Section: Results Of the Topic Analysis For Web Of Sciencementioning
confidence: 96%
“…End-to-end automatic speech recognition [1,2] (ASR) has many applications in human society, empowering voice-based intelligent control, spoken language understanding [3], on-device services [4], and web-based speech interactions [5]. These high-performance speech applications benefit from neural network-based ASR systems that have highly accurate on-device performance with fixed model parameters.…”
Section: Introductionmentioning
confidence: 99%