2012
DOI: 10.5120/8111-1727
|View full text |Cite
|
Sign up to set email alerts
|

Part of Speech Tagging in Manipuri: A Rule based Approach

Abstract: The process of assigning morpho-syntactic categories of each morpheme including punctuation marks in a given text document according to the context is called Part of Speech (POS) tagging. In this paper we represent the rule-based Part of Speech Tagger of Manipuri by applying a set of hand written linguistic rules of Manipuri language. Nevertheless, it is very difficult to classify the lexical categories of Manipuri, an agglutinating Tibeto-Burman language of Northeast India. So, in this tagger we are using the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(10 citation statements)
references
References 7 publications
0
10
0
Order By: Relevance
“…The existing POS tagging systems for Manipuri languages along with the techniques adopted are given in the following table. A set of 3 types of rules-orthographic, morphological and disambiguation has been applied along with the use of a lexicon by [24]. A 3-tier tagset comprising of major category, sub-category and the attributes has been designed.…”
Section: Pos Taggingmentioning
confidence: 99%
“…The existing POS tagging systems for Manipuri languages along with the techniques adopted are given in the following table. A set of 3 types of rules-orthographic, morphological and disambiguation has been applied along with the use of a lexicon by [24]. A 3-tier tagset comprising of major category, sub-category and the attributes has been designed.…”
Section: Pos Taggingmentioning
confidence: 99%
“…However, in this way, the particularities of this kind of communication are removed, and not studied. Other procedures experienced in adapting a POS-tagger for noisy texts are described in the papers [8,9,17].…”
Section: Related Workmentioning
confidence: 99%
“…Jeff et al [25] added the term to the class similarity computation, tending to have a higher priority for smaller classes to be merged. In our experiments we set ≈ 0.…”
Section: Developing Part-of-speech Set Methodsmentioning
confidence: 99%
“…Mart [23] used 47 tags to build a Spanish treebank in Spanish. For developed part-of-speech tagger, Avontuur et al [24] used 25 tags for Dutch, Singha et al [25] used 97 tags for Manipuri, Neunerdt et al [26] used 54 tags for German.…”
Section: Introductionmentioning
confidence: 99%