2008 the Eighth IAPR International Workshop on Document Analysis Systems 2008
DOI: 10.1109/das.2008.73
|View full text |Cite
|
Sign up to set email alerts
|

A Complete Optical Character Recognition Methodology for Historical Documents

Abstract: In this paper a complete OCR

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
32
0
2

Year Published

2009
2009
2019
2019

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 54 publications
(35 citation statements)
references
References 18 publications
0
32
0
2
Order By: Relevance
“…The first was the test set of the ICDAR 2007 Handwriting segmentation competition [1] while the second was a set of Greek historical typewritten documents [2].…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The first was the test set of the ICDAR 2007 Handwriting segmentation competition [1] while the second was a set of Greek historical typewritten documents [2].…”
Section: Resultsmentioning
confidence: 99%
“…The proposed evaluation framework has been applied to several state-of-the-art techniques using two different document image sets. The two sets comprise a) the test set of the ICDAR2007 handwriting segmentation competition [1] and b) a set of Greek historical typewritten documents [2].…”
Section: Introductionmentioning
confidence: 99%
“…Kohonen algorithm that is one of Artificial neural network The experiments also demonstrated that system complexity can be reduced significantly without degrading performance by considering two-layered neural network rather than multiple layered neural networks [14]. In this paper [15] a complete OCR methodology for recognizing historical documents, either printed or handwritten without any knowledge of the font, is presented. The pre-processing and segmentation approach is used in order to detect text lines, words, and characters.…”
Section: Methodsmentioning
confidence: 99%
“…In most cases, historical document recognition systems produce a recognition result that is evaluated in terms of character accuracy at the levels of 90% -95% [15,19]. One of the reasons for this is the fact that several errors are introduced during the segmentation phase of historical documents.…”
Section: Introductionmentioning
confidence: 99%
“…In [4] an open-source programming framework is introduced for building systems that extract information from digitized historical documents empowering the document experts themselves to develop systems with reduced effort. In [19], a complete OCR methodology for recognizing historical documents, either printed or handwritten without any knowledge of the font, is presented. It consists of a pre-processing step, a top-down segmentation step as well as a clustering scheme in order to group characters of similar shape.…”
Section: Introductionmentioning
confidence: 99%