2007
DOI: 10.1117/12.705094
|View full text |Cite
|
Sign up to set email alerts
|

Document image content inventories

Abstract: We report an investigation into strategies, algorithms, and software tools for document image content extraction and inventory, that is, the location and measurement of regions containing handwriting, machine-printed text, photographs, blank space, etc. We have developed automatically trainable methods, adaptable to many kinds of documents represented as bilevel, greylevel, or color images, that offer a wide range of useful tradeoffs of speed versus accuracy using methods for exact and approximate k-Nearest Ne… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2008
2008
2019
2019

Publication Types

Select...
5
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 20 publications
(23 citation statements)
references
References 8 publications
0
23
0
Order By: Relevance
“…Detail description of these features can be found in [3]. The error rate for this set is 13.6% and is shown in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…Detail description of these features can be found in [3]. The error rate for this set is 13.6% and is shown in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…Our classifier, discussed in [4,6,7], is an approximation of kNearest Neighbors and is used to classify each pixel in a document image by assigning it a class label, such as machine print, handwriting, photograph, etc. Features are extracted from each training sample (pixel) from a small, local window of no more than 20 pixels wide.…”
Section: Our Ground Truth Policymentioning
confidence: 99%
“…In previous work [4,5,9,10, 1], we have described a research program investigating versatile algorithms for document image content extraction, that is locating regions containing machine printed text, handwriting, photographs, etc. This program seeks to solve this problem in full generality, handling a vast variety of document and image types.…”
Section: Introductionmentioning
confidence: 99%
“…The bottom-up strategy, on the other hand, performs classification on small, naturally given parts of a document e.g. pixels, connected components, or individual strokes in online documents [2,8,18]. A clustering algorithm may follow to group small entities into larger, meaningful segments.…”
Section: Introductionmentioning
confidence: 99%
“…Top-down methods are prevailing if the document structure can be analyzed rather easily [9] (as in scientific papers or newspapers, for example). Pixel classification is preferred where the structure is difficult to recognize [2] (as in magazines where text and images may be mixed rather irregularly). In the field of online handwritten document analysis, the distinction of text and non-text is accomplished with a bottom-up approach in [8,13] where single strokes, as the smallest entities, are classified.…”
Section: Introductionmentioning
confidence: 99%