Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) 2014
DOI: 10.3115/v1/p14-2028
|View full text |Cite
|
Sign up to set email alerts
|

Improved Iterative Correction for Distant Spelling Errors

Abstract: Noisy channel models, widely used in modern spellers, cope with typical misspellings, but do not work well with infrequent and difficult spelling errors. In this paper, we have improved the noisy channel approach by iterative stochastic search for the best correction. The proposed algorithm allowed us to avoid local minima problem and improve the F 1 measure by 6.6% on distant spelling errors.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
3
0

Year Published

2015
2015
2019
2019

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 7 publications
1
3
0
Order By: Relevance
“…We also observe that iterative correction, i.e. running the speller a second time over already corrected data, further improves performance slightly which is in-line with findings in (Cucerzan and Brill, 2004) and (Gubanov et al, 2014).…”
Section: Hillclimbing and Performancesupporting
confidence: 90%
“…We also observe that iterative correction, i.e. running the speller a second time over already corrected data, further improves performance slightly which is in-line with findings in (Cucerzan and Brill, 2004) and (Gubanov et al, 2014).…”
Section: Hillclimbing and Performancesupporting
confidence: 90%
“…Farra et al (2014) suggest a context-sensitive characterlevel spelling error correction model. Gubanov et al (2014) improve the Cucerzan and Brill (2004) model by iterating the application of the basic noisy channel model for spelling correction in a stochastic manner.…”
Section: Introductionmentioning
confidence: 99%
“…Automatic spelling correction in non-malicious settings has had a long research track as an important component of text processing and normalization. Early works have used edit distance to find morphologically similar corrections (Ristad and Yianilos, 1998), noisy channel model for misspellings (Jurafsky and Martin, 2014), and iterative search to improve corrections of distant spelling errors (Gubanov et al, 2014). Word contexts have been shown to be improve the robustness of spell checkers with ngram language model as one approach to incorporate contextual information (Hassan and Menezes, 2013;Farra et al, 2014).…”
Section: Related Workmentioning
confidence: 99%