2020
DOI: 10.1109/access.2020.2994214
|View full text |Cite
|
Sign up to set email alerts
|

Urdu-Text Detection and Recognition in Natural Scene Images Using Deep Learning

Abstract: Urdu text is a cursive script and belongs to a non-Latin family of other cursive scripts like Arabic, Chinese, and Hindi. Urdu text poses a challenge for detection/localization from natural scene images, and consequently recognition of individual ligatures in scene images. In this paper, a methodology is proposed that covers detection, orientation prediction, and recognition of Urdu ligatures in outdoor images. As a first step, the custom FasterRCNN algorithm has been used in conjunction with well-known CNNs l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
24
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 52 publications
(26 citation statements)
references
References 53 publications
0
24
0
1
Order By: Relevance
“…To compare the outcomes, Daniyal et al presented the Bluechet and Puchserver models. Arafat et al [81] introduce Hijja, a novel dataset of Arabic letters produced only by youngsters aged 7-12. A total of 591 individuals contributed 47,434 characters to our dataset.…”
Section: Literature Reviewmentioning
confidence: 99%
“…To compare the outcomes, Daniyal et al presented the Bluechet and Puchserver models. Arafat et al [81] introduce Hijja, a novel dataset of Arabic letters produced only by youngsters aged 7-12. A total of 591 individuals contributed 47,434 characters to our dataset.…”
Section: Literature Reviewmentioning
confidence: 99%
“…They have been adapted for feature extraction in many text recognition systems. We can cite for example: scene text recognition [12], [13], video text recognition [14], and offline handwriting text recognition [15]- [17]. However, CNN-based or DL-based approaches are still deficient.…”
Section: ) Feature Extractionmentioning
confidence: 99%
“…The deep learning models of non‐spotting in natural and video images can be classified further into regression/anchor based models (Arafat & Iqbal, 2020; Chandio & Pickering, 2019; Huang & Xu, 2019) and segmentation based models (Cai et al, 2020; Dai et al, 2020; Guo et al, 2020). The former class considers the whole text as an object for detection while the latter class considers merging pixel by pixel or character by character for text detection in natural and video images.…”
Section: Non‐spotting Based Mining Approachesmentioning
confidence: 99%