2013 12th International Conference on Document Analysis and Recognition 2013
DOI: 10.1109/icdar.2013.110
|View full text |Cite
|
Sign up to set email alerts
|

On Combining Multiple Segmentations in Scene Text Recognition

Abstract: Abstract-An end-to-end real-time scene text localization and recognition method is presented. The three main novel features are: (i) keeping multiple segmentations of each character until the very last stage of the processing when the context of each character in a text line is known, (ii) an efficient algorithm for selection of character segmentations minimizing a global criterion, and (iii) showing that, despite using theoretically scaleinvariant methods, operating on a coarse Gaussian scale space pyramid yi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
31
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 86 publications
(31 citation statements)
references
References 14 publications
0
31
0
Order By: Relevance
“…Although many approaches (e.g., [1], [2], [3]) have been proposed, this problem remains largely unsolved, e.g., the winning team in ICDAR-2013 "Reading Text in Scene Images" competition achieved only a localization recall of about 66% [4]. The difficulties mainly come from diversities of texts (e.g., languages, font, size, color, orientation, noise, illumination, low contrast, occlusion and so on) as well as the complexity of the backgrounds [5].…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Although many approaches (e.g., [1], [2], [3]) have been proposed, this problem remains largely unsolved, e.g., the winning team in ICDAR-2013 "Reading Text in Scene Images" competition achieved only a localization recall of about 66% [4]. The difficulties mainly come from diversities of texts (e.g., languages, font, size, color, orientation, noise, illumination, low contrast, occlusion and so on) as well as the complexity of the backgrounds [5].…”
Section: Introductionmentioning
confidence: 99%
“…Existing text detection methods can be categorized into three groups: sliding window based methods (e.g., [6], [7], [8]), connected component (CC) based methods (e.g., [1], [2], [5], [9]) and hybrid methods (e.g., [3], [10]). Among them, the extremal-region (ER) based methods, which belong to the connected component based methods, won the first places in both ICDAR-2011 and ICDAR-2013 competitions ( [11], [4]).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In end-to-end method [2] individual characters were detected as Extremal Regions. The regions were first agglomerated into text lines by an efficient pruned exhaustive search that estimates the text direction on each triplet of regions and the constraints induced by the text direction contribute to the similarity measure used for clustering.…”
Section: Existing Text Recognition Methodsmentioning
confidence: 99%
“…Maximally Stable Extremal Regions (MSERs) [22] have been used widely for scene text detection [25,11,29] and segmentation [25,36,26]. A classifier is trained to separate text from background based on the shape of each MSER region, along with other hand-drafted features.…”
Section: Related Workmentioning
confidence: 99%