2012
DOI: 10.15373/22501991/mar2014/11
|View full text |Cite
|
Sign up to set email alerts
|

Kashmir Part of Speech Tagger Using CRF

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 6 publications
0
4
0
Order By: Relevance
“…for which enormous data is available in digital form, Kashmiri language is data deficient. After exploring various resources (Trilingual (English-Hindi-Kashmiri) E-Dictionary (12) , Kashmiri WordNet (13) , dataset used in (14) and other resources), we managed a raw corpus comprising of about 500K tokens. The overall corpus contains text from different domains like Sports, culture, science etc.…”
Section: Raw Corpusmentioning
confidence: 99%
See 1 more Smart Citation
“…for which enormous data is available in digital form, Kashmiri language is data deficient. After exploring various resources (Trilingual (English-Hindi-Kashmiri) E-Dictionary (12) , Kashmiri WordNet (13) , dataset used in (14) and other resources), we managed a raw corpus comprising of about 500K tokens. The overall corpus contains text from different domains like Sports, culture, science etc.…”
Section: Raw Corpusmentioning
confidence: 99%
“…The overall corpus contains text from different domains like Sports, culture, science etc. Using PoS tagger created in research effort (14) thewhole corpus is PoS tagged with an accuracy of 94%.…”
Section: Raw Corpusmentioning
confidence: 99%
“…Kashmiri language mainly spoken by the people of the Kashmiri and is morphologically very rich but no dataset is available for research purpose which poses a great challenge in this study. Dataset used in this study is collected from Kashmiri WordNet, dataset used in [18] , Trilingual Sense Dictionary [19] . In addition, sentences are manually entered using keyboard.…”
Section: Data Collectionmentioning
confidence: 99%
“…However, by increasing the training data the POS-Tagger may result in better performance as was evident by its result summary. The system performance got raised from 67.22% to 81.10% by varying the training data size from 15000 to 27000 (27,28) .…”
Section: Part-of-speech (Pos) Taggingmentioning
confidence: 99%