2023
DOI: 10.1038/s41598-023-47295-2
|View full text |Cite
|
Sign up to set email alerts
|

Correcting spelling mistakes in Persian texts with rules and deep learning methods

Sa. Kasmaiee,
Si. Kasmaiee,
M. Homayounpour

Abstract: This study aims to develop a system for automatically correcting spelling errors in Persian texts using two approaches: one that relies on rules and a common spelling mistake list and another that uses a deep neural network. The list of 700 common misspellings was compiled, and a database of 55,000 common Persian words was used to identify spelling errors in the rule-based approach. 112 rules were implemented for spelling correction, each providing suggested words for misspelled words. 2500 sentences were used… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 15 publications
(4 citation statements)
references
References 63 publications
0
1
0
Order By: Relevance
“…Artificial neural networks (ANNs), as introduced by McCulloch and Pitts [69], emerged as a result of studying brain functionality and subsequently found application in computer programs [6,70,71]. In addition, it is important to note that any ANN comprises numerous individual units, commonly referred to as neurons or processing elements (PE).…”
Section: Classification Viaartificial Neural Network (Ann)mentioning
confidence: 99%
See 1 more Smart Citation
“…Artificial neural networks (ANNs), as introduced by McCulloch and Pitts [69], emerged as a result of studying brain functionality and subsequently found application in computer programs [6,70,71]. In addition, it is important to note that any ANN comprises numerous individual units, commonly referred to as neurons or processing elements (PE).…”
Section: Classification Viaartificial Neural Network (Ann)mentioning
confidence: 99%
“…G − Mean Sensitivity × Specificity (5) where TP: True positive, FP: false positive, TN: true negative, and FN: false negative are shown [8,71,74,75].…”
Section: Performance Metricsmentioning
confidence: 99%
“…The proposed model consists of a hybrid of CNNs and LSTM. Hybrid CNN and LSTM model is widely used in many fields [29,38]. After setting parameters and preparing the dataset with the designed simulator, features are extracted using CNN and LSTM.…”
Section: Proposed Mobile Molecular Communication (Mmc) Modelmentioning
confidence: 99%
“…We propose a self-adaptive speech feature fusion network SSFF-Net that dynamically integrates enhanced features with original noisy features, fully integrating enhanced features and original speech information. In recent years, there has been a number of research into textual spelling error correction, which has used methods including rule-based and deep learning-based methods [12]. Error correction techniques have been widely used to refine the sentences output of ASR models [13]- [14], including error detection and correction.…”
Section: Introductionmentioning
confidence: 99%