ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414029
|View full text |Cite
|
Sign up to set email alerts
|

RNN Transducer Models for Spoken Language Understanding

Abstract: We present a comprehensive study on building and adapting RNN transducer (RNN-T) models for spoken language understanding (SLU). These end-to-end (E2E) models are constructed in three practical settings: a case where verbatim transcripts are available, a constrained case where the only available annotations are SLU labels and their values, and a more restrictive case where transcripts are available but not corresponding audio. We show how RNN-T SLU models can be developed starting from pre-trained automatic sp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(1 citation statement)
references
References 29 publications
(36 reference statements)
0
1
0
Order By: Relevance
“…The graph clearly shows that additional data helps to improve the accuracy of the classifier. Our second dataset was composed of recordings from a call center with manual transcription and labeling [12][13][14]. There are in total about 30,000 samples split into test (5,600), train (21,900), and development (2,500) sets.…”
Section: Dataset Amentioning
confidence: 99%
“…The graph clearly shows that additional data helps to improve the accuracy of the classifier. Our second dataset was composed of recordings from a call center with manual transcription and labeling [12][13][14]. There are in total about 30,000 samples split into test (5,600), train (21,900), and development (2,500) sets.…”
Section: Dataset Amentioning
confidence: 99%