Structure detection and segmentation of documents using 2D stochastic context-free grammars

Alvaro, Francisco; Cruz, Francisco; Sánchez, Joan-Andreu; Terrades, Oriol Ramos; Benedí, José-Miguel

doi:10.1016/j.neucom.2014.08.076

Cited by 8 publications

(21 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We tested the proposed approach on a benchmark dataset proposed by [5] for addressing the segmentation of historical handwritten documents. As previously discussed, one significant problem to train CNNs is the limited number of labeled samples in the dataset and in other collections available for research.…”

Section: Methodsmentioning

confidence: 99%

“…We therefore generated a synthetic dataset adopting the approach described in Section 2 obtaining a synthetic training set with 81,060 pages with associated information on the number of records in each page. In particular, the training set contains pages with a number of records comprised between 3 and 9 even if the benchmark collection in [5] contains only pages with 5,6, or 7 records each. When generating the training set we used the real images only to infer the structure of the pages and the overall structure of the records as well as to extract the page background.…”

Section: Methodsmentioning

confidence: 99%

“…In the first one, we used stratified cross validation to estimate the error rate on a larger dataset. In the second experiment we compared the results achievable by the proposed approach with those described in [5] considering the same splitting of the data in training, validation, and test datasets.…”

Section: Methodsmentioning

confidence: 99%

“…In the second experiment, we compared the results obtained by our system with those presented in [5]. In order to perform a fair comparison we used the splitting of training and test data as proposed in [5].…”

Section: Benchmark Splitmentioning

confidence: 99%

“…The record detection problem has been addressed in [5], where an EM-based layout analysis method is proposed. The approach is tested on a collection of marriage license books where each page contains a variable number of handwritten records.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Deep neural networks for record counting in historical handwritten documents

Capobianco

Marinai

2019

Pattern Recognition Letters

View full text Add to dashboard Cite

Abstract-In this paper, we investigate the use of Convolutional Neural Networks for counting the number of records in historical handwritten documents. With this work we demonstrate that training the networks only with synthetic images allows us to perform a near perfect evaluation of the number of records printed on historical documents. The experiments have been performed on a benchmark dataset composed by marriage records and outperform previous results on this dataset.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Benchmark Splitmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Deep neural networks for record counting in historical handwritten documents

Capobianco

Marinai

2019

Pattern Recognition Letters

View full text Add to dashboard Cite

show abstract

Layout Analysis of PDF Documents by Two-Dimensional Grammars for the Production of Accessible Textbooks

Kohase

Nakamura

Fujiyoshi

2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

This paper proposes the use of two-dimensional context-free grammars (2DCFGs) for layout analysis of PDF documents. In Japan, audio textbooks have been available for students with print disabilities in compulsory education. In order to create accessible textbooks including audio textbooks, it is necessary to obtain the information of structure and the reading order of documents of regular textbooks in PDF. It is not simple task because most PDF files only have the information how to print them out, and page-layouts of most textbooks are complex. By using 2DCFGs, we could obtain useful information of regular textbooks in PDF for the production of accessible textbooks.

show abstract

Complexity of Two-Dimensional Rank-Reducing Grammars

Průša

2020

Descriptional Complexity of Formal Systems

View full text Add to dashboard Cite

Structure detection and segmentation of documents using 2D stochastic context-free grammars

Cited by 8 publications

References 32 publications

Deep neural networks for record counting in historical handwritten documents

Deep neural networks for record counting in historical handwritten documents

Layout Analysis of PDF Documents by Two-Dimensional Grammars for the Production of Accessible Textbooks

Complexity of Two-Dimensional Rank-Reducing Grammars

Contact Info

Product

Resources

About