2017
DOI: 10.3390/info8040157
|View full text |Cite
|
Sign up to set email alerts
|

Bidirectional Long Short-Term Memory Network with a Conditional Random Field Layer for Uyghur Part-Of-Speech Tagging

Abstract: Abstract:Uyghur is an agglutinative and a morphologically rich language; natural language processing tasks in Uyghur can be a challenge. Word morphology is important in Uyghur part-of-speech (POS) tagging. However, POS tagging performance suffers from error propagation of morphological analyzers. To address this problem, we propose a few models for POS tagging: conditional random fields (CRF), long short-term memory (LSTM), bidirectional LSTM networks (BI-LSTM), LSTM networks with a CRF layer, and BI-LSTM netw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
2

Relationship

2
7

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 6 publications
(5 reference statements)
0
6
0
1
Order By: Relevance
“…The above methods [4,[20][21][22] improve the accuracy of POS tagging by enriching the input information. To further improve the accuracy of POS tagging, some works [26][27][28] combine Bi-LSTM with CRF since CRF can learn sentence-level tag information. In addition to CRF, Bi-LSTM can be integrated with adversarial neural networks to extract better features [29,30], which also can improve the accuracy.…”
Section: Pos Taggingmentioning
confidence: 99%
“…The above methods [4,[20][21][22] improve the accuracy of POS tagging by enriching the input information. To further improve the accuracy of POS tagging, some works [26][27][28] combine Bi-LSTM with CRF since CRF can learn sentence-level tag information. In addition to CRF, Bi-LSTM can be integrated with adversarial neural networks to extract better features [29,30], which also can improve the accuracy.…”
Section: Pos Taggingmentioning
confidence: 99%
“…The inherent syllable structure is (initial sound) + nucleus sound + (final sound), where the nucleus sound must be a vowel. There can be no initial sound or final sound in the syllable, but there must be a nucleus sound [26][27][28]. The present-day Uyghur is classified into 12 syllabic types.…”
Section: Syllables Of Uyghurmentioning
confidence: 99%
“…The authors of [36] constructed a contemporary Uyghur grammatical information dictionary that provided extensive grammatical information and collocation features and is presently a primary resource for NLP research specific to the Uyghur language. The Uyghur Dependency Treebank was built from a public reading corpus [37,38], and also presently serves as an important tool for Uyghur linguistic studies. A standard scheme for tagging Uyghur sentences to construct a typical knowledge graph is illustrated in Table 1.…”
Section: Related Workmentioning
confidence: 99%