Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames

Zayene, Oussama; Touj, Sameh Masmoudi; Hennebert, Jean; Ingold, Rolf; Amara, Najoua Essoukri Ben

doi:10.3390/jimaging4020032

Cited by 14 publications

(5 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Type of Content Availability Size of Dataset ACTIV2 [12] Embedded words Public 10,415 text images QTID [13] Synthetic words Private 309,720 words and 249,428 characters IFN/ENIT [14] Handwritten words Public 115,000 words and 212,000 characters AHDB [15] Handwritten words and digits Private 30,000 words APTI [16] Printed words Public 113,284 words and 648,280 characters HACDB [17] Handwritten characters Public 6600 characters and 50 writers UPTI [18] Printed text lines Public 10,000 text lines Digital Jawi [19] Jawi paleography images Public 168 words and 1524 characters KHATT [20] Handwritten text lines Public 9327 lines, 165,890 words and 589,924 characters ALIF [21] Embedded text lines Upon request 1804 words and 89,819 characters ACTIV [22] Embedded text lines Public 4824 lines and 21,520 words SmartATID [23] Printed and handwritten pages Public 9088 pages Degraded historical [24] Handwritten documents Public 10 handwritten images and 10 printed images Printed PAW [25] Printed subwords Upon request 415,280 unique words and 550,000 sub words Checks [26] Handwritten subwords and digits Private 29,498 subwords and 15,148 digits Numeral [27] Handwritten digits Public 21,120 digits and 44 writers Forms [28] Handwritten characters Private 15,800 characters and 500 writers KAFD [29] Printed pages and lines Public 28,767 pages and 644,006 lines AHDBIFTR [30] Handwritten images Public 497 word images and 5 writers ARABASE [31] Handwritten text Public 47,000 words and 500 free Arabic sentences CEDAR [32] Handwritten pages Private 20,000 words, 10 writers, and 100 documents CENPARMI [26] Handwritten subwords and digits Public 6000 digit images Shafi and Zia [33] surveyed automatic Urdu text recognition techniques and described the algorithms, techniques, datasets, challenges, and future directions for Urdu OCR. Additionally, [34] reviewed the availability of datasets and suggested more training data to address the unique challenges of OCR systems.…”

Section: Datasetmentioning

confidence: 99%

A Survey of OCR in Arabic Language: Applications, Techniques, and Challenges

et al. 2023

View full text Add to dashboard Cite

Optical character recognition (OCR) is the process of extracting handwritten or printed text from a scanned or printed image and converting it to a machine-readable form for further data processing, such as searching or editing. Automatic text extraction using OCR helps to digitize documents for improved productivity and accessibility and for preservation of historical documents. This paper provides a survey of the current state-of-the-art applications, techniques, and challenges in Arabic OCR. We present the existing methods for each step of the complete OCR process to identify the best-performing approach for improved results. This paper follows the keyword-search method for reviewing the articles related to Arabic OCR, including the backward and forward citations of the article. In addition to state-of-art techniques, this paper identifies research gaps and presents future directions for Arabic OCR.

show abstract

Section: Datasetmentioning

confidence: 99%

A Survey of OCR in Arabic Language: Applications, Techniques, and Challenges

et al. 2023

View full text Add to dashboard Cite

show abstract

“…The ACTIV 2.0 Dataset (Zayene et al 2018a) is a public dataset that was extracted from 189 video clips, and produces 4,063 key-frames for detection and 10,415 cropped text images for recognition. This dataset is distributed with open-source tools for annotation and evaluation.…”

Section: Arabic Optical Character Recognition Datasetmentioning

confidence: 99%

A Review of Arabic Text Recognition Dataset

Al-Sheikh¹,

Mohd²,

Warlina³

2020

APJITM

View full text Add to dashboard Cite

Building a robust Optical Character Recognition (OCR) system for languages, such as Arabic with cursive scripts, has always been challenging. These challenges increase if the text contains diacritics of different sizes for characters and words. Apart from the complexity of the used font, these challenges must be addressed in recognizing the text of the Holy Quran. To solve these challenges, the OCR system would have to undergo different phases. Each problem would have to be addressed using different approaches, thus, researchers are studying these challenges and proposing various solutions. This has motivate this study to review Arabic OCR dataset because the dataset plays a major role in determining the nature of the OCR systems. State-of-the-art approaches in segmentation and recognition are discovered with the implementation of Recurrent Neural Networks (Long Short-Term Memory-LSTM and Gated Recurrent Unit-GRU) with the use of the Connectionist Temporal Classification (CTC). This also includes deep learning model and implementation of GRU in the Arabic domain. This paper has contribute in profiling the Arabic text recognition dataset thus determining the nature of OCR system developed and has identified research direction in building Arabic text recognition dataset.

show abstract

“…Various efforts have been reported for capturing and preparing the datasets for Arabic text in natural images in the recent past. Some articles presented a survey on available open access datasets and tools specifically designed for Arabic text detection and recognition in video frames captured by news channels [76]. The benchmark dataset for Arabic scene text still requires more effort to standardized the research as far as Arabic scene text analysis is concerned.…”

Section: Arabic Scene Text Datasetsmentioning

confidence: 99%

Arabic Cursive Text Recognition from Natural Scene Images

Ahmed

Naz²,

Razzak

et al. 2019

Applied Sciences

View full text Add to dashboard Cite

This paper presents a comprehensive survey on Arabic cursive scene text recognition. The recent years’ publications in this field have witnessed the interest shift of document image analysis researchers from recognition of optical characters to recognition of characters appearing in natural images. Scene text recognition is a challenging problem due to the text having variations in font styles, size, alignment, orientation, reflection, illumination change, blurriness and complex background. Among cursive scripts, Arabic scene text recognition is contemplated as a more challenging problem due to joined writing, same character variations, a large number of ligatures, the number of baselines, etc. Surveys on the Latin and Chinese script-based scene text recognition system can be found, but the Arabic like scene text recognition problem is yet to be addressed in detail. In this manuscript, a description is provided to highlight some of the latest techniques presented for text classification. The presented techniques following a deep learning architecture are equally suitable for the development of Arabic cursive scene text recognition systems. The issues pertaining to text localization and feature extraction are also presented. Moreover, this article emphasizes the importance of having benchmark cursive scene text dataset. Based on the discussion, future directions are outlined, some of which may provide insight about cursive scene text to researchers.

show abstract

Open Datasets and Tools for Arabic Text Detection and Recognition in News Video Frames

Cited by 14 publications

References 48 publications

A Survey of OCR in Arabic Language: Applications, Techniques, and Challenges

A Survey of OCR in Arabic Language: Applications, Techniques, and Challenges

A Review of Arabic Text Recognition Dataset

Arabic Cursive Text Recognition from Natural Scene Images

Contact Info

Product

Resources

About