2019 International Conference on Document Analysis and Recognition Workshops (ICDARW) 2019
DOI: 10.1109/icdarw.2019.50110
|View full text |Cite
|
Sign up to set email alerts
|

Arabic Handwritten Documents Segmentation into Text-Lines and Words using Deep Learning

Abstract: One of the most important steps in a handwriting recognition system is text-line and word segmentation. But, this step is made difficult by the differences in handwriting styles, problems of skewness, overlapping and touching of text and the fluctuations of text-lines. It is even more difficult for ancient and calligraphic writings, as in Arabic manuscripts, due to the cursive connection in Arabic text, the erroneous position of diacritic marks, the presence of ascending and descending letters, etc. In this wo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 29 publications
(17 citation statements)
references
References 24 publications
0
10
0
Order By: Relevance
“…With deep learning approaches, semantic segmentation is widely used. In (Neche et al, 2019) the authors propose to segmentation of a historical document into text line by using the Residual U-Net architecture and classify pixels of the input image into three classes: background, paragraphs, and lines of text. In (Aïcha Gader and Echi, 2020) also use the modified U-Net architecture by integrating a recurrent residual convolutional neural network (RRCNN) with an attention mechanism called AR2U-Net to find precise features in a specific region, and the performance on BADAM dataset gives 93.7 % of F-measure.…”
Section: Text Line Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…With deep learning approaches, semantic segmentation is widely used. In (Neche et al, 2019) the authors propose to segmentation of a historical document into text line by using the Residual U-Net architecture and classify pixels of the input image into three classes: background, paragraphs, and lines of text. In (Aïcha Gader and Echi, 2020) also use the modified U-Net architecture by integrating a recurrent residual convolutional neural network (RRCNN) with an attention mechanism called AR2U-Net to find precise features in a specific region, and the performance on BADAM dataset gives 93.7 % of F-measure.…”
Section: Text Line Extractionmentioning
confidence: 99%
“…The authors in (Neche et al, 2019) use deep learning for word segmentation by putting the text line image into a CNN, which extracts the features that sequentially pass to the bidirectional long short-term memory (BLSTM) network followed by CTC function (Graves et al, 2006) in order to find the alignment sequence between words and spaces. The alignment result on the lines of KHATT dataset reaches 80.1% of the F-measure.…”
Section: Word Extractionmentioning
confidence: 99%
“…The detection of text lines has been widely explored in historical manuscript text books [26,9] and other historical documents of different natures, such as newspapers [25], meteorological tables [1] finding aids [33], as well as many other supports. With index tables, one can consider the issue as a two-class image segmentation task: we separate text lines from the background.…”
Section: Document Image Analysismentioning
confidence: 99%
“…The baseline definition was modified slightly towards manuscripts written in Arabic scripts. Mechi et al [28] and Neche et al [29] used an U-net and RU-net deep-learning models, which are variants of FCN. The models are trained for X-height based pixel-wise classifications of text lines.…”
Section: Related Workmentioning
confidence: 99%