2021
DOI: 10.1109/access.2021.3119627
|View full text |Cite
|
Sign up to set email alerts
|

Development of Bangla Spell and Grammar Checkers: Resource Creation and Evaluation

Abstract: A spell and grammar checker is profoundly essential for diverse publications especially for Bangla language in particular as it is spoken by millions of native speakers around the world. Considering the lack of research efforts, we demonstrate the development of a comprehensive Bangla spell and grammar checker with necessary resources. At first, a full-fledged and generalised Bangla monolingual corpus comprising over 100 million words has been built by scraping reputed, diversified online sources and then an e… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 26 publications
0
6
0
Order By: Relevance
“…This process took more than 1 month. Moreover, we have taken nearly 2 million sentences from the reputed Bengali monolingual corpus NHMono01 [26] in the newspaper domain. The complete newspaper dataset formation process is shown in Figure 4.…”
Section: Newspaper Domainmentioning
confidence: 99%
“…This process took more than 1 month. Moreover, we have taken nearly 2 million sentences from the reputed Bengali monolingual corpus NHMono01 [26] in the newspaper domain. The complete newspaper dataset formation process is shown in Figure 4.…”
Section: Newspaper Domainmentioning
confidence: 99%
“…The double metaphone encoding table [16], [17] has been implemented as the encoder. Based on the probable phonetics of the expression, each Bangla letter has been encoded with one or more English letters.…”
Section: Double Metaphone Encodingmentioning
confidence: 99%
“…In recent years, a few Bangla language researchers have demonstrated a keen interest in Bangla corpus development and Bangla text analysis. Some notable progress has been documented in corpus creation in [15], [16], [17], [18], [19], [20] and knowledge engineering on Bangla language in [21], [22], [23], [24], etc. The first electronic Bangla corpus was constructed by the Central Institute of Indian Languages (CIIL) from 1991 to 1995 [25].…”
Section: Bangla Language Corpusmentioning
confidence: 99%
“…They have also presented several results of the study of features of the Bangla language based on empirical analysis of the corpus. In [20] a new corpus NHMono01 consisting of 100,142,522 tokens was developed. Their developed spell and grammar checker application of the corpus demonstrated some excellent results.…”
Section: Bangla Language Corpusmentioning
confidence: 99%