2015
DOI: 10.1016/j.neucom.2014.08.076
|View full text |Cite
|
Sign up to set email alerts
|

Structure detection and segmentation of documents using 2D stochastic context-free grammars

Abstract: In this paper we define a bidimensional extension of Stochastic Context-Free Grammars for structure detection and segmentation of images of documents. Two sets of text classification features are used to perform an initial classification of each zone of the page. Then, the document segmentation is obtained as the most likely hypothesis according to a stochastic grammar. We used a dataset of historical marriage license books to validate this approach. We also tested several inference algorithms for Probabilisti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 8 publications
(21 citation statements)
references
References 32 publications
0
21
0
Order By: Relevance
“…We tested the proposed approach on a benchmark dataset proposed by [5] for addressing the segmentation of historical handwritten documents. As previously discussed, one significant problem to train CNNs is the limited number of labeled samples in the dataset and in other collections available for research.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…We tested the proposed approach on a benchmark dataset proposed by [5] for addressing the segmentation of historical handwritten documents. As previously discussed, one significant problem to train CNNs is the limited number of labeled samples in the dataset and in other collections available for research.…”
Section: Methodsmentioning
confidence: 99%
“…We therefore generated a synthetic dataset adopting the approach described in Section 2 obtaining a synthetic training set with 81,060 pages with associated information on the number of records in each page. In particular, the training set contains pages with a number of records comprised between 3 and 9 even if the benchmark collection in [5] contains only pages with 5,6, or 7 records each. When generating the training set we used the real images only to infer the structure of the pages and the overall structure of the records as well as to extract the page background.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations