2022
DOI: 10.1109/tpami.2021.3122572
|View full text |Cite
|
Sign up to set email alerts
|

Content and Style Aware Generation of Text-Line Images for Handwriting Recognition

Abstract: Handwritten Text Recognition has achieved an impressive performance in public benchmarks. However, due to the high inter-and intra-class variability between handwriting styles, such recognizers need to be trained using huge volumes of manually labeled training data. To alleviate this labor-consuming problem, synthetic data produced with TrueType fonts has been often used in the training loop to gain volume and augment the handwriting style variability. However, there is a significant style bias between synthet… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(10 citation statements)
references
References 40 publications
(47 reference statements)
0
9
0
Order By: Relevance
“…To evaluate their readability, the Character Error Rate (CER) is also adopted. Efforts towards the definition of a task-specific score for HTG have recently brought to the vFID score [8] and the Handwriting Distance (HWD) score [54]. In this work, we consider a number of evaluation scores and settings to favor the adoption of our proposed evaluation protocol and to provide a meaningful comparison between existing HTG approaches.…”
Section: Related Workmentioning
confidence: 99%
“…To evaluate their readability, the Character Error Rate (CER) is also adopted. Efforts towards the definition of a task-specific score for HTG have recently brought to the vFID score [8] and the Handwriting Distance (HWD) score [54]. In this work, we consider a number of evaluation scores and settings to favor the adoption of our proposed evaluation protocol and to provide a meaningful comparison between existing HTG approaches.…”
Section: Related Workmentioning
confidence: 99%
“…Afterwards, the cGAN [34] allows controlling the class of generated contents by inputting an additional class condition. The latest techniques [3], [35]- [40] focus on line-level style-guided text image generation. These systems synthesized glyph-realistic text line images with different writer styles, usually optimized by discriminators, writer classifiers, and character recognizers.…”
Section: Text Image Generationmentioning
confidence: 99%
“…S TYLE-GUIDED text image generation is a challenging task, which tries to synthesize text images by imitating reference style image's appearance while keeping text content unaltered (e.g. [1]- [3]). The appearance of a text image includes foreground and background color patterns, spatial transformations and deformations, typography (for printed text), writing styles and stroke thickness (for handwriting text), etc [2].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Continuous speech recognition requires an LM to be built into an efficient system. An LM is essential for many applications in natural language processing (NLP), including but not limited to handwriting recognition [ 1 , 2 , 3 , 4 ], machine translation [ 5 ], speech recognition [ 6 , 7 ], integral model [ 8 ], phonemes [ 9 ], and vowels [ 10 ].…”
Section: Introductionmentioning
confidence: 99%