2022
DOI: 10.3390/signals3030032
|View full text |Cite
|
Sign up to set email alerts
|

Text Line Extraction in Historical Documents Using Mask R-CNN

Abstract: Text line extraction is an essential preprocessing step in many handwritten document image analysis tasks. It includes detecting text lines in a document image and segmenting the regions of each detected line. Deep learning-based methods are frequently used for text line detection. However, only a limited number of methods tackle the problems of detection and segmentation together. This paper proposes a holistic method that applies Mask R-CNN for text line extraction. A Mask R-CNN model is trained to extract t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 29 publications
0
10
0
Order By: Relevance
“…In fact, the other study we know of, where Mask-RCNN was used for text line segmentation in historical documents with Latin-based or Arabic scripts [ 16 ], gives much more encouraging results. In this study, the network was trained to segment fragments of text lines into overlapping patches, which are then merged together to reconstruct the entire page with its multiple segmented text lines.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…In fact, the other study we know of, where Mask-RCNN was used for text line segmentation in historical documents with Latin-based or Arabic scripts [ 16 ], gives much more encouraging results. In this study, the network was trained to segment fragments of text lines into overlapping patches, which are then merged together to reconstruct the entire page with its multiple segmented text lines.…”
Section: Related Workmentioning
confidence: 99%
“…These encouraging results confirm our preliminary data showing that Mask-RCNN is a promising network for text line segmentation of historical documents, at least with Western scripts [ 30 , 31 ]. In parallel with [ 16 ], we actually developed another approach to exploit the Mask-RCNN architecture for the same purpose, while overcoming some of the weaknesses of their own approach. Indeed, the networks with which these authors challenged Mask-RCNN are not recent ones and, unlike U-Net networks, cannot be considered as state-of-the-art artificial neural networks.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…These models based Fully Convolutional Networks (FCN) are trained to detect the baseline or x-heights of text line which are not enough to extract a complete line. Recently (Droby et al, 2022) used mask R-CNN to extract lines of text from the entire page by combining the results from the segmentation of small patches of the page document. They reaches 87.96% of F-measure on 10 pages of the VML-AHTE dataset with post-processing that refines the extraction of lines of text.…”
Section: Text Line Extractionmentioning
confidence: 99%