2020
DOI: 10.1016/j.dib.2020.105749
|View full text |Cite
|
Sign up to set email alerts
|

Cursive-Text: A Comprehensive Dataset for End-to-End Urdu Text Recognition in Natural Scene Images

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(9 citation statements)
references
References 3 publications
0
9
0
Order By: Relevance
“…Figure 5 representing that which dataset is the most used and which is the least used and as per the resulting signal based EMG data has only used twice in Khan et al (2020b) and Khan et al (2020a) whereas dataset which contains the images of sign used four times in Halim & Abbas (2014) , Kanwal et al (2014) , Nasir et al (2014) and Imran et al (2021a) . The most used type of dataset are the character-based datasets in Chandio et al (2020) , Naseem et al (2019) , Sagheer et al (2010) , Ahmad et al (2017) , Sami, (2014) , Husnain et al (2019) , Gul et al (2020) Arafat & Iqbal (2020) and Ahmed et al (2017) . Also, in Fig.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…Figure 5 representing that which dataset is the most used and which is the least used and as per the resulting signal based EMG data has only used twice in Khan et al (2020b) and Khan et al (2020a) whereas dataset which contains the images of sign used four times in Halim & Abbas (2014) , Kanwal et al (2014) , Nasir et al (2014) and Imran et al (2021a) . The most used type of dataset are the character-based datasets in Chandio et al (2020) , Naseem et al (2019) , Sagheer et al (2010) , Ahmad et al (2017) , Sami, (2014) , Husnain et al (2019) , Gul et al (2020) Arafat & Iqbal (2020) and Ahmed et al (2017) . Also, in Fig.…”
Section: Resultsmentioning
confidence: 99%
“…Also, in Fig. 6 , we can observe that only five datasets are publically available out of four datasets ( Chandio et al, 2020 ; Sagheer et al, 2010 ; Arafat & Iqbal, 2020 ; Ahmed et al, 2017 ) contain either character, numeral, or sentence-based images. Only one dataset ( Liberati et al, 2009 ) is publically available, which is based on images of the visually impaired individual making signs of Urdu Language and the highest accuracy reported in publically available datasets is 97% by Chandio et al (2020) , which is reported on the text-based dataset and not on sign based dataset which is the actual lackness in this area.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, the text in news ticker images generally appears at either bottom or top on images, which makes the text localization task easier. The largest Urdu scene text dataset is presented in [39], where author has collected 2500 natural outdoor images with three different languages text, Urdu, English and Sindhi. This dataset is further processed to get cropped isolated characters and word dataset.…”
Section: A Datasets Of Text In Natural Imagementioning
confidence: 99%
“…Second, datasets for scene text localization and recognition are often limited to Latin script, which is considerably easier to process compared to cursive scripts (e.g. Japanese, Chinese, Persian and Arabic) [12], [13]. Moreover, these datasets are occasionally collected in a transportation context, which is again discouraging for road safety researchers.…”
Section: Introductionmentioning
confidence: 99%