Form similarity via Levenshtein distance between ortho-filtered logarithmic ruling-gap ratios

Nagy, George; Lopresti, Daniel

doi:10.1117/12.2041956

Cited by 3 publications

(4 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Form images have been represented in a large variety of ways for classification tasks (see [5] for a survey). These representations include statistics of image connected components [37], BoVW [6,24,38], OCR features [2,32,33], pyramids of average gray-scale values [18,32], Viola-Jones features [34], Hidden Tree Markov Models [10], sequences of line segments [12,20], sequence of line gap ratios [28], run length histograms [16], Shape Context Features [22], and most recently learned features from Convolutional Neural Networks [17,21,38].…”

Section: Form Image Classificationmentioning

confidence: 99%

“…In [12,20,28], forms are represented as sequences of vertical and horizontal rule lines, which are compared using a similarity metric such as edit distance or clique finding in an association graph. While these methods discretize or ignore the position or length of lines, CONFIRM performs a novel edit distance directly on a continuous representation of line segments, making it more robust to line detection errors.…”

Section: Form Image Classificationmentioning

confidence: 99%

“…While form recognition is a well-studied problem (e.g. [5,10,16,20,28,32,37]), fewer works, have considered the task of form clustering. The goal in this case is to partition the collection by form types (not known apriori ), which we define to be the exact layout structure of the form encompassing preprinted lines, text, and figures.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

CONFIRM – Clustering of noisy form images using robust matching

Tensmeyer

Martinez

2019

Pattern Recognition

View full text Add to dashboard Cite

Identifying the type of a scanned form greatly facilitates processing, including automated field segmentation and field recognition. Contrary to the majority of existing techniques, we focus on unsupervised type identification, where the set of form types are not known apriori, and on noisy collections that contain very similar document types. This work presents a novel algorithm: CONFIRM (Clustering Of Noisy Form Images using Robust Matching), which simultaneously discovers the types in a collection of forms and assigns each form to a type. CONFIRM matches type-set text and rule lines between forms to create domain specific features, which we show outperform Bag of Visual Word (BoVW) features employed by the current state-of-the-art. To scale to large document collections, we use a bootstrap approach to clustering, where only a small subset of the data is clustered directly, while the rest of the data is assigned to clusters in linear time. We show that CONFIRM reduces average cluster impurity by 44% compared to the state-of-the art on 5 collections of historical forms that contain significant noise. We also show competitive performance on the relatively clean NIST tax form collection.

show abstract

Section: Form Image Classificationmentioning

confidence: 99%

Section: Form Image Classificationmentioning

confidence: 99%

See 1 more Smart Citation

CONFIRM – Clustering of noisy form images using robust matching

Tensmeyer

Martinez

2019

Pattern Recognition

View full text Add to dashboard Cite

show abstract

“…Preliminary results on classification of some degraded forms were presented at the 2014 SPIE Conference on Document Recognition and Retrieval [38].…”

Section: Prior Workmentioning

confidence: 99%

On Parallel Lines in Noisy Forms

Nagy

2014

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. Quantification of the rectilinear configuration of typeset rules (lines) opens the way to form classification and content extraction. Line detection on scanned forms is often accomplished with the Hough transform. Here it is followed by simultaneous extraction of the dominant perpendicular sets of extracted lines, which ensures rotation invariance. Translation and scale invariance are attained by using minimal horizontal and vertical sets of distance ratios ("rule gap ratios") instead of rule-edge locations. The ratios are logarithmically mapped to an alphabet so that the resulting symbol strings can be classified by. edit distance. Some probability distributions associated with these steps are derived. Analytical considerations and small-scale experiments on scanned forms suggest that this approach has potential merit for highly degraded forms.

show abstract

Invariant representation for rectilinear rulings

Nagy

2014

J. Electron. Imaging

View full text Add to dashboard Cite

Form similarity via Levenshtein distance between ortho-filtered logarithmic ruling-gap ratios

Cited by 3 publications

References 18 publications

CONFIRM – Clustering of noisy form images using robust matching

CONFIRM – Clustering of noisy form images using robust matching

On Parallel Lines in Noisy Forms

Invariant representation for rectilinear rulings

Contact Info

Product

Resources

About