2012
DOI: 10.1007/978-3-642-29364-1_12
|View full text |Cite
|
Sign up to set email alerts
|

NEOCR: A Configurable Dataset for Natural Image Text Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 36 publications
(17 citation statements)
references
References 11 publications
0
17
0
Order By: Relevance
“…There are more than 100 types of frequently-used languages all over the world, but a majority of the existing methods and benchmarks (except for [12,27,[79][80][81]) have focused on texts in English. In this age of globalization, it is urgent and indispensable to build systems that are able to handle multilingual texts and serve the people in the whole world.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…There are more than 100 types of frequently-used languages all over the world, but a majority of the existing methods and benchmarks (except for [12,27,[79][80][81]) have focused on texts in English. In this age of globalization, it is urgent and indispensable to build systems that are able to handle multilingual texts and serve the people in the whole world.…”
Section: Discussionmentioning
confidence: 99%
“…• NEOCR The NEOCR dataset 10) [79] includes images with multioriented texts in natural scenes. It contains 659 real world images with 5 238 annotated bounding boxes.…”
Section: Benchmark Datasets • Icdar 2003 and 2005mentioning
confidence: 99%
“…The NEOCR dataset [33] contains 659 natural scene images with multi-oriented texts of high variability (see Figure 2c for examples). This database is intended for scene text recognition and provided multilingual evaluation environments, as it includes texts in eight European languages.…”
Section: Literature Reviewmentioning
confidence: 99%
“…For those tests, we used the "Word recognition" dataset from the ICDAR 2003 Competition [14] and the NEOCR dataset [15], using HCP or CPGS, the image was segmented and then for each segment the Tesseract OCR engine [16] was used to recognize the character.…”
Section: Characters Recognitionmentioning
confidence: 99%