2023
DOI: 10.1186/s13636-023-00313-7
|View full text |Cite
|
Sign up to set email alerts
|

Improving speech recognition systems for the morphologically complex Malayalam language using subword tokens for language modeling

Kavya Manohar,
Jayan A R,
Rajeev Rajan

Abstract: This article presents the research work on improving speech recognition systems for the morphologically complex Malayalam language using subword tokens for language modeling. The speech recognition system is built using a deep neural network–hidden Markov model (DNN-HMM)-based automatic speech recognition (ASR). We propose a novel method, syllable-byte pair encoding (S-BPE), that combines linguistically informed syllable tokenization with the data-driven tokenization method of byte pair encoding (BPE). The pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 48 publications
0
0
0
Order By: Relevance