End-to-End Measure for Text Recognition

Leifert, Gundram; Labahn, Roger; Grüning, Tobias; Leifert, Svenja

doi:10.1109/icdar.2019.00-16

Cited by 5 publications

(8 citation statements)

References 6 publications

(15 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The RO provided by reference transcripts and/or other layout GT annotations is generally only one among several possible RO annotations which would be all correct. Therefore, mixing RO and word recognition errors into a single assessment measure (as in [14,25]) does mot seem the best idea for understanding which are the inner issues of an end-to-end full-page HTR system.…”

Section: The Reading Order Problemmentioning

confidence: 99%

“…Finally, both measures are somehow combined to obtain a single scalar figure which hopefully represents an "overall performance" metric [14]. In a similar vein, but explicitly devoted to HTR evaluation, the work presented in [25],goes deeper in the metric combination idea, with daunting mathematical formulation. However, this is a utterly theoretical work which does not provide any empirical evidence that would support the proposed formulation or methods in practice.…”

Section: Integrating Evaluation Of Wer and Reading Order Mismatchmentioning

confidence: 99%

“…The difficulties underlying the evaluation of page-level HTR results boil down to a Reading Order (RO) problem [5,30,11,41,36,47]. A number of recent proposals try to heuristically weight and combine both word recognition and LA geometric errors into a single scalar value [25,14]. Unfortunately, this hinders the capability to sort out the nature of the corresponding errors and thereby making a comprehensive, useful assessment.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

End-to-End Page-Level Assessment of Handwritten Text Recognition

Vidal¹,

Toselli²,

Ríos-Vila³

et al. 2023

Preprint

View full text Add to dashboard Cite

The evaluation of Handwritten Text Recognition (HTR) systems has traditionally used metrics based on the edit distance between HTR and ground truth (GT) transcripts, at both the character and word levels. This is very adequate when the experimental protocol assumes that both GT and HTR text lines are the same, which allows edit distances to be independently computed to each given line. Driven by recent advances in pattern recognition, HTR systems increasingly face the end-to-end page-level transcription of a document, where the precision of locating the different text lines and their corresponding reading order (RO) play a key role. In such a case, the standard metrics do not take into account the inconsistencies that might appear. In this paper, the problem of evaluating HTR systems at the page level is introduced in detail. We analyze the convenience of using a two-fold evaluation, where the transcription accuracy and the RO goodness are considered separately. Different alternatives are proposed, analyzed and empirically compared both through partially simulated and through real, full end-to-end experiments. Results support the validity of the proposed two-fold evaluation approach. An important conclusion is that such an evaluation can be adequately achieved by just two simple and well-known metrics: the Word Error Rate, that takes transcription sequentiality into account, and the here re-formulated Bag of Words Word Error Rate, that ignores order. While the latter directly and very accurately assess intrinsic word recognition errors, the difference between both metrics gracefully correlates with the Spearman's Foot Rule Distance, a metric which explicitly measures RO errors associated with layout analysis flaws.

show abstract

Section: The Reading Order Problemmentioning

confidence: 99%

Section: Integrating Evaluation Of Wer and Reading Order Mismatchmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

End-to-End Page-Level Assessment of Handwritten Text Recognition

Vidal¹,

Toselli²,

Ríos-Vila³

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…CER is the inverted accuracy, and defined as CER = (i + s + d) / n, where n is the total number of characters, i the minimal number of character insertions, s the substitutions and d the deletions required to transform the reference text into the OCR output. 6 [16] propose an "end-to-end measure" which is based on the CER, but with alignment between GT and OCR results in a way that makes it configurable whether differences in the reading order or the over-/under-segmentation of text lines are penalized.…”

Section: State-of-the-artmentioning

confidence: 99%

A survey of OCR evaluation tools and metrics

Neudecker

Baierer

Gerber

et al. 2021

The 6th International Workshop on Historical Document Imaging and Processing

View full text Add to dashboard Cite

“…The difficulties underlying the evaluation of page-level HTR results boil down to a Reading Order (RO) problem [7,26,30,33] . A number of recent proposals try to heuristically weight and combine both word recognition and LA geometric errors into a single scalar value [10,19] . Unfortunately, this hinders the capability to sort out the nature of the corresponding errors and thereby making a comprehensive, useful assessment.…”

Section: Introductionmentioning

confidence: 99%