2011 International Conference on Document Analysis and Recognition 2011
DOI: 10.1109/icdar.2011.142
|View full text |Cite
|
Sign up to set email alerts
|

Word Retrieval in Historical Document Using Character-Primitives

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 9 publications
0
9
0
Order By: Relevance
“…1. From a scanned original document, we use the Agora and Retro software * developed by Ramel et al [15][16][17] at the University of Tours, France. This step allows us to extract all instances of the characters used in the document, and group them together, in order to obtain sets of 'a', b' ...…”
Section: Process Descriptionmentioning
confidence: 99%
“…1. From a scanned original document, we use the Agora and Retro software * developed by Ramel et al [15][16][17] at the University of Tours, France. This step allows us to extract all instances of the characters used in the document, and group them together, in order to obtain sets of 'a', b' ...…”
Section: Process Descriptionmentioning
confidence: 99%
“…Changes in the writing hand, skewness, and also changes in the baseline are some of the key problems. It has as well been observed that, in old manuscripts, the writers have not well distinguished between inter-word spaces and intra-word spaces (inter-letters, for example) [1]. This phenomenon seems to be universal for all the languages [2].…”
Section: Introductionmentioning
confidence: 99%
“…One direction is to use a primitive-or graphemebased representation in order to create a grammar for the huge set of possible character shapes based on a limited set of graphemes [1]. However, graphemes require a subcharacter segmentation, and this segmentation could be an ill-posed problem because the ground-truth data is at the character level, not at the grapheme level.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For document images, the following databases are the most commonly used. For printed documents: Washington UW3 [2], LRDE [3], RETAS-OCR [4], PaRADIIT [5], etc. ; for handwritten documents IAM database [6], RIMES [7], GERMANA [8], etc.…”
Section: Introductionmentioning
confidence: 99%