2018 13th IAPR International Workshop on Document Analysis Systems (DAS) 2018
DOI: 10.1109/das.2018.44
|View full text |Cite
|
Sign up to set email alerts
|

Comparing Machine Learning Approaches for Table Recognition in Historical Register Books

Abstract: We present in this paper experiments on Table Recognition in hand-written registry books. We first explain how the problem of row and column detection is modelled, and then compare two Machine Learning approaches (Conditional Random Field and Graph Convolutional Network) for detecting these table elements. Evaluation was conducted on death records provided by the Archive of the Diocese of Passau. Both methods show similar results, a 89 F1 score, a quality which allows for Information Extraction. Software and d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
21
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 21 publications
(21 citation statements)
references
References 14 publications
0
21
0
Order By: Relevance
“…The table rows are detected and evaluated in a second step (see Section III-B). The dataset will be published as part of a competition dataset on table recognition planned in 2018 within the context of the READ project 5 . The GT is represented as extended PAGE XML (see https://github.com/ Transkribus/TranskribusPageformat).…”
Section: Datasetmentioning
confidence: 99%
See 3 more Smart Citations
“…The table rows are detected and evaluated in a second step (see Section III-B). The dataset will be published as part of a competition dataset on table recognition planned in 2018 within the context of the READ project 5 . The GT is represented as extended PAGE XML (see https://github.com/ Transkribus/TranskribusPageformat).…”
Section: Datasetmentioning
confidence: 99%
“…In this section, the table structure matching using association graphs is presented. As a second part, the work of Clinchant et al [5] is shortly summarized which allows for row detection based on detected columns of the presented table matching and detected baselines.…”
Section: Table Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…Passau Diocesan Archives are also working to improve the automated Layout Analysis of tabular data by sorting tables from their collection into different categories that can be used as training templates. Improved table recognition and the possibility of exporting tabular data will have significant implications for the field, since many archival documents are laid out in tables and forms (Clinchant et al, 2018). The National Archives of Finland [27] would like to enhance the usability of their vast collections of governmental records, many of which are digitised but not transcribed.…”
mentioning
confidence: 99%