2021
DOI: 10.1007/978-3-030-87034-8_9
|View full text |Cite
|
Sign up to set email alerts
|

Improvement of Intent Classification Using Diacritic Restoration for Text Message in Chatbot

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 16 publications
0
4
0
Order By: Relevance
“…The statistical language model can be created not only on the word level but on the character level, as in [21]. During the first stage, for recognized words, it uses a statistical n-gram language model with n = [1,4] that works on the word level; during the second stage, it processes the out-of-vocabulary words with the statistical n-gram character-based model that works on the character level. The authors proved that their offered approach led to the better diacritization accuracy of the Arabic dialectal texts.…”
Section: Statistics-based Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…The statistical language model can be created not only on the word level but on the character level, as in [21]. During the first stage, for recognized words, it uses a statistical n-gram language model with n = [1,4] that works on the word level; during the second stage, it processes the out-of-vocabulary words with the statistical n-gram character-based model that works on the character level. The authors proved that their offered approach led to the better diacritization accuracy of the Arabic dialectal texts.…”
Section: Statistics-based Approachesmentioning
confidence: 99%
“…The diacritics restoration also resulted in a better text-to-speech performance for Romanians [3]. Used as the integrative NLU component, the diacritics restoration also improved the accuracy of the intent classification-based Vietnamese dialogue system [4,5]. Similarly, statistical machine translation performance was positively correlated with correctly diacritized words for Arabic [6].…”
Section: Introductionmentioning
confidence: 99%
“…Diacritics restoration also resulted in better text-to-speech performance for Romanian [3]. Used as the integrative NLU component, diacritics restoration also improved the accuracy of the intent classification-based Vietnamese dialogue system [4,5]. Besides, statistical machine translation performance was positively correlated with correctly diacritized words for Arabic [6].…”
Section: Introductionmentioning
confidence: 99%
“…The statistical language model can be created not only on the word level but on the character level as in [21]. During the first stage, for recognized words it uses a statistical n-gram language model with n = [1,4] that works on the word level; during the second stage, it processes out-of-vocabulary words with the statistical n-gram character-based model that works on the character level. The authors prove that their offered approach leads to the better diacritization accuracy of the Arabic dialectal texts.…”
Section: Introductionmentioning
confidence: 99%

Correcting diacritics and typos with a ByT5 transformer model

Stankevičius,
Lukoševičius,
Kapočiūtė-Dzikienė
et al. 2022
Preprint