2020
DOI: 10.1007/978-3-030-54956-5_17
|View full text |Cite
|
Sign up to set email alerts
|

Layout Detection and Table Recognition – Recent Challenges in Digitizing Historical Documents and Handwritten Tabular Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…When we started this project, Transkribus was not able to extract the structure of the source image. Similar challenges were reported for other projects transcribing historical sources with tabular structure (Lehenmeier et al, 2020). However, Transkribus offers a good GUI solution for the manual labelling and exporting of images that is useful for professional transcribers (Vézina et al, 2019).…”
Section: Comparison To Related Workmentioning
confidence: 61%
“…When we started this project, Transkribus was not able to extract the structure of the source image. Similar challenges were reported for other projects transcribing historical sources with tabular structure (Lehenmeier et al, 2020). However, Transkribus offers a good GUI solution for the manual labelling and exporting of images that is useful for professional transcribers (Vézina et al, 2019).…”
Section: Comparison To Related Workmentioning
confidence: 61%
“…A study for the layout analysis of historical newspapers has been conducted which achieved very good results e.g. [27]. One study explores visual trends in newspapers and models them as a multimodal construct consisting of text and images.…”
Section: Applications Of Image Processing In Digital Humanitiesmentioning
confidence: 99%
“…The extraction of different layout elements of articles is an important component of scientific data curation, with the accuracy of extraction of the elements such as tables, figures and their captions increasing significantly over the past several years [4,15,25,51]. A large field of study within document layout analysis is the "mining" of PDFs as newer PDFs are generally in "vector" format -the document is rendered from a set of instructions instead of pixel-by-pixel as in a raster format, and, in theory, the set of instructions can be parsed to determine the locations of figures, captions and tables [3,9,23].…”
Section: Introductionmentioning
confidence: 99%