Iti Mathur scite author profile

Iti Mathur

5Publications

58Citation Statements Received

15Citation Statements Given

How they've been cited

108

How they cite others

Affiliations

Banasthali University

Publications

Order By: Most citations

Development of Marathi part of speech tagger using statistical approach

Singh

Joshi

Mathur

2013

View full text Add to dashboard Cite

Part-of-speech (POS) tagging is a process of assigning the words in a text corresponding to a particular part of speech. A fundamental version of POS tagging is the identification of words as nouns, verbs, adjectives etc. For processing natural languages, Part of Speech tagging is a prominent tool. It is one of the simplest as well as most constant and statistical model for many NLP applications. POS Tagging is an initial stage of linguistics, text analysis like information retrieval, machine translator, text to speech synthesis, information extraction etc. In POS Tagging we assign a Part of Speech tag to each word in a sentence and literature. Various approaches have been proposed to implement POS taggers. In this paper we present a Marathi part of speech tagger. It is morphologically rich language. Marathi is spoken by the native people of Maharashtra. The general approach used for development of tagger is statistical using Unigram, Bigram, Trigram and HMM Methods. It presents a clear idea about all the algorithms with suitable examples. It also introduces a tag set for Marathi which can be used for tagging Marathi text. In this paper we have shown the development of the tagger as well as compared to check the accuracy of taggers output. The three Marathi POS taggers viz. Unigram, Bigram, Trigram and HMM gives the accuracy of 77.38%, 90.30%, 91.46% and 93.82% respectively.

show abstract

Named Entity Recognition in Hindi Using Hidden Markov Model

Chopra

Joshi

Mathur

2016

View full text Add to dashboard Cite

Shiva: A Framework for Graph based Ontology Matching

Mathur¹,

Joshi²,

Darbari³

et al. 2014

IJCA

View full text Add to dashboard Cite

Since long, corporations are looking for knowledge sources which can provide structured description of data and can focus on meaning and shared understanding. Structures which can facilitate open world assumptions and can be flexible enough to incorporate and recognize more than one name for an entity. A source whose major purpose is to facilitate human communication and interoperability. Clearly, databases fail to provide these features and ontologies have emerged as an alternative choice, but corporations working on same domain tend to make different ontologies. The problem occurs when they want to share their data/knowledge. Thus we need tools to merge ontologies into one. This task is termed as ontology matching. This is an emerging area and still we have to go a long way in having an ideal matcher which can produce good results. In this paper we have shown a framework to matching ontologies using graphs.

show abstract

Rule based stemmer in Urdu

Gupta

Joshi

Mathur

2013

View full text Add to dashboard Cite

Urdu is a combination of several languages like Arabic, Hindi, English, Turkish, Sanskrit etc. It has a complex and rich morphology. This is the reason why not much work has been done in Urdu language processing. Stemming is used to convert a word into its respective root form. In stemming, we separate the suffix and prefix from the word. It is useful in search engines, natural language processing and word processing, spell checkers, word parsing, word frequency and count studies. This paper presents a rule based stemmer for Urdu. The stemmer that we have discussed here is used in information retrieval. We have also evaluated our results by verifying it with a human expert.

show abstract

Part of Speech Tagging of Marathi Text Using Trigram Method

Singh¹,

Joshi²,

Mathur³

2013

IJAIT

View full text Add to dashboard Cite

In this paper we present a Marathipart of speech tagger. It is morphologically rich language. it is spoken by the native people of Maharashtra. The general approach used for development of tagger is statistical using Trigram Method. The main concept of Trigram is to explore the most likely POS for a token based on given information of previous two tags by calculating probabilities to determine whichthe best sequence of tag is. In this paper we show the development of the tagger. Moreover we have also shown the evaluation done.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Iti Mathur

Development of Marathi part of speech tagger using statistical approach

Named Entity Recognition in Hindi Using Hidden Markov Model

Shiva: A Framework for Graph based Ontology Matching

Rule based stemmer in Urdu

Part of Speech Tagging of Marathi Text Using Trigram Method

Contact Info

Product

Resources

About