International Conference on Advances in Pattern Recognition 1999
DOI: 10.1007/978-1-4471-0833-7_30
|View full text |Cite
|
Sign up to set email alerts
|

Categorizing Document Images into Script and Language Classes

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
10
0

Year Published

2004
2004
2016
2016

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(10 citation statements)
references
References 4 publications
0
10
0
Order By: Relevance
“…However, all sample images in corpus three are coupled with slight skew distortion with skew angle controlled under 20 degree. Unlike some methods [7,8] that require document restoration first, word images are transformed to WSCs directly based on our proposed word shape coding scheme. Experiment results show 157 text image are correctly identified with average identification rate reaching over 97%.…”
Section: Language Identificationmentioning
confidence: 99%
See 3 more Smart Citations
“…However, all sample images in corpus three are coupled with slight skew distortion with skew angle controlled under 20 degree. Unlike some methods [7,8] that require document restoration first, word images are transformed to WSCs directly based on our proposed word shape coding scheme. Experiment results show 157 text image are correctly identified with average identification rate reaching over 97%.…”
Section: Language Identificationmentioning
confidence: 99%
“…Lastly, most reported language identification techniques [5,7,8] cannot identify language in document images that contain just a few words. We therefore construct the fourth corpus to evaluate the performance of our proposed technique with respect to word number.…”
Section: Language Identificationmentioning
confidence: 99%
See 2 more Smart Citations
“…A number uses character-based features or connected component analysis [2], [3]. The paradox inherent in such an approach is that it is sometimes necessary to know the script of the document in order to extract such components.…”
Section: Introductionmentioning
confidence: 99%