DOI: 10.4995/thesis/10251/191001
|View full text |Cite
|
Sign up to set email alerts
|

Streaming Automatic Speech Recognition with Hybrid Architectures and Deep Neural Network Models

Abstract: Current state-of-the-art models based on Long Short-Term Memory (LSTM) networks have been extensively used in ASR to improve performance. However, using LSTMs under a streaming setup is not straightforward due to real-time constraints. In this paper we present a novel streaming decoder that includes a bidirectional LSTM acoustic model as well as an unidirectional LSTM language model to perform the decoding efficiently while keeping the performance comparable to that of an off-line setup. We perform a one-pass … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 41 publications
(59 reference statements)
0
1
0
Order By: Relevance
“…Researchers of the MLLP ASR team collaborated on the papers that use the output of ASR systems instead of the reference transcriptions. Specifically, recent advances in streaming ASR technology (Jorge Cano 2022) allowed this…”
Section: Mt Systemmentioning
confidence: 99%
“…Researchers of the MLLP ASR team collaborated on the papers that use the output of ASR systems instead of the reference transcriptions. Specifically, recent advances in streaming ASR technology (Jorge Cano 2022) allowed this…”
Section: Mt Systemmentioning
confidence: 99%