2018
DOI: 10.14569/ijacsa.2018.090953
|View full text |Cite
|
Sign up to set email alerts
|

Printed Arabic Script Recognition: A Survey

Abstract: Optical character recognition (OCR) is essential in various real-world applications, such as digitizing learning resources to assist visually impaired people and transforming printed resources into electronic media. However, the development of OCR for printed Arabic script is a challenging task. These challenges are due to the specific characteristics of Arabic script. Therefore, different methods have been proposed for developing Arabic OCR systems, and this paper aims to provide a comprehensive review of the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 103 publications
0
4
0
Order By: Relevance
“…Alghamdi and Teahan [39] discussed the most commonly used datasets for training and evaluation of OCR systems for printed Arabic script, including the IFN/ENIT Arabic handwritten dataset, the "Handwriting Arabic Corpus" (HAC) dataset, and the RIMES dataset containing a large collection of printed and handwritten documents. The authors provide an overview of the available datasets and emphasize the importance of high-quality datasets for improving the accuracy of OCR systems.…”
Section: Datasetmentioning
confidence: 99%
“…Alghamdi and Teahan [39] discussed the most commonly used datasets for training and evaluation of OCR systems for printed Arabic script, including the IFN/ENIT Arabic handwritten dataset, the "Handwriting Arabic Corpus" (HAC) dataset, and the RIMES dataset containing a large collection of printed and handwritten documents. The authors provide an overview of the available datasets and emphasize the importance of high-quality datasets for improving the accuracy of OCR systems.…”
Section: Datasetmentioning
confidence: 99%
“…Additionally, natural languages that use the Arabic writing system extends the base alphabets by adding special diacritics over some characters to better adapt the writing system to the phonemes of the designated language. A thorough discussing about these challenges can be found in [1]. All these characteristics make the recognition of Arabic text a challenging task, especially for the models that depend on segmenting characters prior to the recognition process [2].…”
Section: Challenges Related To Arabic Text Recognitionmentioning
confidence: 99%
“…Further, the convolution process in the model employed zero padding so that it can preserve the size of the input image throughout the convolution process. The pooling process in the initial two layers used a sliding window of size (2x2) while the remaining three layers used a window of size (1,2).…”
Section: Proposed Modelmentioning
confidence: 99%
“…As stated in [17], many works were introduced that utilizes fuzzy logic within Arabic OCR applications. In [18], some of these approaches, features are modeled by fuzzy linguistic variables, and fuzzy rules are then used for classification.…”
Section: Related Workmentioning
confidence: 99%