2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops) 2011
DOI: 10.1109/iccvw.2011.6130247
|View full text |Cite
|
Sign up to set email alerts
|

Text detection and recognition in urban scenes

Abstract: Text detection and recognition in real images taken in unconstrained environments, such as street view images, remain surprisingly challenging in Computer Vision.In this paper, we present a comprehensive strategy combining bottom-up and top-down mechanisms to detect Text boxes. The bottom-up part is based on character segmentation and grouping . The top-down part is achieved with a statistical learning approach based on box descriptors. Our main contribution consists in introducing a new descriptor, Fuzzy HOG … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0
2

Year Published

2013
2013
2021
2021

Publication Types

Select...
4
4
2

Relationship

0
10

Authors

Journals

citations
Cited by 28 publications
(18 citation statements)
references
References 12 publications
(30 reference statements)
0
16
0
2
Order By: Relevance
“…Overlay text detection schemes can be categorized as either patch based or geometrical property based [1]. Patch based techniques extract features from image patches and identify text regions using pre-trained classifiers [2]. These patches are grouped further to detect the text regions.…”
Section: Introductionmentioning
confidence: 99%
“…Overlay text detection schemes can be categorized as either patch based or geometrical property based [1]. Patch based techniques extract features from image patches and identify text regions using pre-trained classifiers [2]. These patches are grouped further to detect the text regions.…”
Section: Introductionmentioning
confidence: 99%
“…Edge-based methods are effective when the background is relatively plain; however, in real-world scenes, strong edges are very common so extra effort should be devoted to reducing false positives. For example, inspired by face and pedestrian detectors [23], Histogram of Oriented Gradients (HOG) based approaches include edge orientation or spatial information, and perform multi-scale character detection via sliding window classification [2]- [4]. However, due to the large search space, at present HOG-based approaches are too computationally expensive for real-time implementation.…”
Section: A Environmental Text Spottingmentioning
confidence: 99%
“…The motivating application for text classifiers such as T-HOG and R-HOG is the detection of text in photos and videos of arbitrary scenes [29,30]. Specifically, the idea is to use the classifier to filter the output of a fast but "permissive" (high-recall and moderate-precision) detector.…”
Section: T-hog As a Post-filter To Text Detectionmentioning
confidence: 99%