2009 10th International Conference on Document Analysis and Recognition 2009
DOI: 10.1109/icdar.2009.237
|View full text |Cite
|
Sign up to set email alerts
|

Markov Random Field Based Text Identification from Annotated Machine Printed Documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2010
2010
2022
2022

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 26 publications
(11 citation statements)
references
References 13 publications
0
11
0
Order By: Relevance
“…Training on handwritten CCs and improved segmentation of touching characters and annotations (Fig. 8) would also likely improve precision/recall to match or exceed levels reported in [14] while competing with error rates reported in state-of-the-art approaches [18].…”
Section: Discussionmentioning
confidence: 79%
See 1 more Smart Citation
“…Training on handwritten CCs and improved segmentation of touching characters and annotations (Fig. 8) would also likely improve precision/recall to match or exceed levels reported in [14] while competing with error rates reported in state-of-the-art approaches [18].…”
Section: Discussionmentioning
confidence: 79%
“…A SVM is used to isolate signatures from machine print in [12] and to identify sparse handwritten annotations occurring at arbitrary orientations in [13]. Kmeans clustering followed by MRF relabeling is used in [14] for segmentation of handwriting, machine print and noise with an overall recall of 96.33%. In [18] a novel approach to automatically discover features pushes error rates of handwriting and machine print to 13.8%.…”
Section: Introductionmentioning
confidence: 99%
“…Patches that were generated as a single unit but contained both machine printed text and handwriting (including crosses, edit marks, and other annotations) within a single patch as shown in Fig.1 after using the dilation operation described in [18,14] on the HP Labs data set were divided into training and testing sets.…”
Section: Methodsmentioning
confidence: 99%
“…In [18], Zheng et al proposed a two step approach to identify three different types of patches (machine printed text, handwriting and noise) in mixed documents. Peng et al [14] used a Markov Random Field (MRF) based method to relabel three types of patches after initial classification. By projecting each word horizontally, Guo and Ma separated handwritten material from documents using a hidden Markov model [9].…”
Section: Introductionmentioning
confidence: 99%
“…Recent work in pixel-level labeling would seem to be applicable 7-9 because fragmentation of foreground markings into units smaller than connected components is essential to this task, while much academic work has stopped at the connected component level. 10,11 The classic segmentation/recognition dilemma of mainstream computer vision is operative here: given correctly segmented image layers, OCR and graphics recognition would be much more tractable, and conversely, if the text and graphical shapes were known in advance, it would be relatively easier to discover their poses and supporting pixel-level evidence in the image.…”
Section: Data Capture / Functional Role Labelingmentioning
confidence: 99%