2020
DOI: 10.48550/arxiv.2010.03449
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Super-Human Performance in Online Low-latency Recognition of Conversational Speech

Abstract: Achieving super-human performance in recognizing human speech has been a goal for several decades, as researchers have worked on increasingly challenging tasks. In the 1990's it was discovered, that conversational speech between two humans turns out to be considerably more difficult than read speech as hesitations, disfluencies, false starts and sloppy articulation complicate acoustic processing and require robust handling of acoustic, lexical and language context, jointly. Early attempts with statistical mode… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…Attention-based models based on sequence-to-sequence (S2S) [3,37,40,49] are currently one of the top-performing approaches to end-to-end ASR and MT. A significant amount of study has already been spent to improving the performance of S2S models.…”
Section: Asr and Mtmentioning
confidence: 99%
See 2 more Smart Citations
“…Attention-based models based on sequence-to-sequence (S2S) [3,37,40,49] are currently one of the top-performing approaches to end-to-end ASR and MT. A significant amount of study has already been spent to improving the performance of S2S models.…”
Section: Asr and Mtmentioning
confidence: 99%
“…LSTMbased [39] models include 6 bidirectional layers for the encoder and 2 unidirectional layers for the decoder, with 1536 units in each. They have delivered superior recognition performance on the Switchboard conversational speech benchmark task [40]. The Transformer-based model proposed in [48] feature 24 encoder layers and 8 decoder layers.…”
Section: Asrmentioning
confidence: 99%
See 1 more Smart Citation