2013
DOI: 10.1117/12.2004788
|View full text |Cite
|
Sign up to set email alerts
|

Automated recognition and extraction of tabular fields for the indexing of census records

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 3 publications
0
8
0
Order By: Relevance
“…Document segmentation into logical snippets, such as text lines or table cells, is often used as a preprocessing step for text recognition systems [26]. For forms, which are defined by a rigid structure of fields, accurate segmentation can be acheived via alignment with a form template which encodes the relative spacing of the segmentation cuts for that form's particular structure or type [7,29]. Such form templates can even be created automatically by simultaneous registration of many forms of the same type [19].…”
Section: Introductionmentioning
confidence: 99%
“…Document segmentation into logical snippets, such as text lines or table cells, is often used as a preprocessing step for text recognition systems [26]. For forms, which are defined by a rigid structure of fields, accurate segmentation can be acheived via alignment with a form template which encodes the relative spacing of the segmentation cuts for that form's particular structure or type [7,29]. Such form templates can even be created automatically by simultaneous registration of many forms of the same type [19].…”
Section: Introductionmentioning
confidence: 99%
“…Anything that can be done to reduce the size of n will reduce the compute time. Second, we have shown in previous work 10 that handwriting recognition accuracy drops significantly when comparing between different handwriting styles and out-ofvocabulary words. The main ramification of the decision to break the collection into groups is that learning in one group does not transfer to the next group.…”
Section: Precomputing Morphing Costsmentioning
confidence: 99%
“…10 Also, the Fourier-Mellin transform has been shown to be effective in determining the transform parameters to rectify a document image with strong delineated structure. 11 Consensus based techniques can help in discovering cell boundaries.…”
Section: Document Preprocessingmentioning
confidence: 99%
See 1 more Smart Citation
“…Making use of the pre-printed table substrate, they use Hough transform to detect the horizontal and vertical rulings that constitute the tabular structure. Clawson et al present a projection-profile based method to detect and extract handwritten tabular fields from historical census forms [5]. We notice that these existing techniques are evaluated on datasets where rulings are usually salient and well displaced, meaning no other lines will distract table analysis.…”
Section: Introductionmentioning
confidence: 99%