Nava Ehsan scite author profile

SUMMARY Producing electronic rather than paper documents has considerable benefits such as easier organizing and data management. Therefore, existence of automatic writing assistance tools such as spell and grammar checker/correctors can increase the quality of electronic texts by removing noise and correcting the erroneous sentences. Different kinds of errors in a text can be categorized into spelling, grammatical and real‐word errors. In this article, we present a language‐independent approach based on a statistical machine translation framework to develop a proofreading tool, which detects grammatical errors as well as context‐sensitive spelling mistakes (real‐word errors). A hybrid model for grammar checking is suggested by combining the mentioned approach with an existing rule‐based grammar checker. Experimental results on both English and Persian languages indicate that the proposed statistical method and the rule‐based grammar checker are complementary in detecting and correcting syntactic errors. The results of the hybrid grammar checker, applied to some English texts, show an improvement of about 24% with respect to the recall metric with almost similar value for precision. Experiments on real‐world data set show that state‐of‐the‐art results are achieved for grammar checking and context‐sensitive spell checking for Persian language. Copyright © 2012 John Wiley & Sons, Ltd.

show abstract

Towards grammar checker development for Persian language

Ehsan

Faili

2010

View full text Add to dashboard Cite

Using a Dictionary and n-gram Alignment to Improve Fine-grained Cross-Language Plagiarism Detection

Ehsan

Tompa

Shakery

2016

View full text Add to dashboard Cite

The Web offers fast and easy access to a wide range of documents in various languages, and translation and editing tools provide the means to create derivative documents fairly easily. This leads to the need to develop effective tools for detecting cross-language plagiarism. Given a suspicious document, cross-language plagiarism detection comprises two main subtasks: retrieving documents that are candidate sources for that document and analyzing those candidates one by one to determine their similarity to the suspicious document. In this paper we focus on the second subtask and introduce a novel approach for assessing cross-language similarity between texts for detecting plagiarized cases. Our proposed approach has two main steps: a vector-based retrieval framework that focuses on high recall, followed by a more precise similarity analysis based on dynamic text alignment. Experiments show that our method outperforms the methods of the best results in PAN-2012 and PAN-2014 in terms of plagdet score. We also show that aligning n-gram units, instead of aligning complete sentences, improves the accuracy of detecting plagiarism. CCS Concepts •Information systems → Near-duplicate and plagiarism detection; Dictionaries; Multilingual and cross-lingual retrieval; Digital libraries and archives; •Applied computing → Language translation; Document analysis;

show abstract

Vafa spell-checker for detecting spelling, grammatical, and real-word errors of Persian language

Faili

Ehsan

Montazery

et al. 2014

Digital Scholarship Humanities

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Nava Ehsan

Candidate document retrieval for cross-lingual plagiarism detection using two-level proximity information

Grammatical and context‐sensitive error correction using a statistical machine translation framework

Towards grammar checker development for Persian language

Using a Dictionary and n-gram Alignment to Improve Fine-grained Cross-Language Plagiarism Detection

Vafa spell-checker for detecting spelling, grammatical, and real-word errors of Persian language

Contact Info

Product

Resources

About