2020
DOI: 10.1515/bfp-2020-0024
|View full text |Cite
|
Sign up to set email alerts
|

OCR-D kompakt: Ergebnisse und Stand der Forschung in der Förderinitiative

Abstract: ZusammenfassungBereits seit einigen Jahren werden große Anstrengungen unternommen, um die im deutschen Sprachraum erschienenen Drucke des 16.–18. Jahrhunderts zu erfassen und zu digitalisieren. Deren Volltexttransformation konzeptionell und technisch vorzubereiten, ist das übergeordnete Ziel des DFG-Projekts OCR-D, das sich mit der Weiterentwicklung von Verfahren der Optical Character Recognition befasst. Der Beitrag beschreibt den aktuellen Entwicklungsstand der OCR-D-Software und analysiert deren erste Tests… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 6 publications
0
2
0
Order By: Relevance
“…DocExtractor’s architecture relies on an encoder-decoder (namely a modified U-Net [ 43 ] with a ResNet-18 [ 44 ] encoder) for pixel-wise segmentation. We trained this “out-of-the-box” network on our data using the recommended hyper-parameters ( , acccessed on 2 October 2022) and used it to benchmark our YOLO model, as it has specifically been proposed for processing historical documents and because its architecture is commonly used in state-of-the-art OCR systems [ 45 ] to segment pages and extract text regions, outperforming Mask-RCNN [ 6 ] as shown in [ 8 ].…”
Section: Detecting Visual Elements In the Sphaera ...mentioning
confidence: 99%
“…DocExtractor’s architecture relies on an encoder-decoder (namely a modified U-Net [ 43 ] with a ResNet-18 [ 44 ] encoder) for pixel-wise segmentation. We trained this “out-of-the-box” network on our data using the recommended hyper-parameters ( , acccessed on 2 October 2022) and used it to benchmark our YOLO model, as it has specifically been proposed for processing historical documents and because its architecture is commonly used in state-of-the-art OCR systems [ 45 ] to segment pages and extract text regions, outperforming Mask-RCNN [ 6 ] as shown in [ 8 ].…”
Section: Detecting Visual Elements In the Sphaera ...mentioning
confidence: 99%
“…It implements an iterative workflow that allows for rapidly training very accurate OCR models for specific publications or publication series. Lastly, OCR-D [5,6] is a workfloworiented, modular platform integrating several OCR engines into a common architecture. Unlike OCR4all, OCR-D was developed for a technical audience, such as staff working in the digitization units of cultural heritage institutions.…”
Section: Related Workmentioning
confidence: 99%