Proceedings of the Proceedings of the 1st International Conference on Informatics, Engineering, Science and Technology, INCITES 2019
DOI: 10.4108/eai.18-7-2019.2287842
|View full text |Cite
|
Sign up to set email alerts
|

A Quranic Dataset for Text Recognition

Abstract: Any text recognition or Optical Character Recognition (OCR) system requires a dataset to learn how to recognize the text. Due to the lack of a standard benchmark, most of the studies in this field were conducted using private datasets without a fair comparison. In this work, we used the standard Mushaf al Madinah benchmark where there are some rules in writing style, for example, the page should start with the beginning of verse and end with the end of verse. Following these rules make the words vary in size a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 6 publications
0
1
0
Order By: Relevance
“…The text recognition system must recognise the whole word, but the Arabic language is a cursive language where the characters are connected to construct a word or subword. Thus, [26,27] introduced the subword dataset where they built the Arabic word from more than one group, such as the word Quran ‫قران(‬ (.…”
Section: Introductionmentioning
confidence: 99%
“…The text recognition system must recognise the whole word, but the Arabic language is a cursive language where the characters are connected to construct a word or subword. Thus, [26,27] introduced the subword dataset where they built the Arabic word from more than one group, such as the word Quran ‫قران(‬ (.…”
Section: Introductionmentioning
confidence: 99%