2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.87
|View full text |Cite
|
Sign up to set email alerts
|

Deep Direct Regression for Multi-oriented Scene Text Detection

Abstract: In this paper, we first provide a new perspective to divide existing high performance object detection methods into direct and indirect regressions. Direct regression performs boundary regression by predicting the offsets from a given point, while indirect regression predicts the offsets from some bounding box proposals. Then we analyze the drawbacks of the indirect regression, which the recent state-of-the-art detection structures like Faster-RCNN and SSD follows, for multi-oriented scene text detection, and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
200
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
4
3
3

Relationship

1
9

Authors

Journals

citations
Cited by 372 publications
(200 citation statements)
references
References 27 publications
0
200
0
Order By: Relevance
“…[69] and [64] propose methods which first detect text segments and then link them into text instances by spatial relationship or link predictions. Zhou et al [82] and He et al [27] regress text boxes directly from dense segmentation maps. Lyu et al [51] propose to detect and group the corner points of the text to generate text boxes.…”
Section: Scene Text Detectionmentioning
confidence: 99%
“…[69] and [64] propose methods which first detect text segments and then link them into text instances by spatial relationship or link predictions. Zhou et al [82] and He et al [27] regress text boxes directly from dense segmentation maps. Lyu et al [51] propose to detect and group the corner points of the text to generate text boxes.…”
Section: Scene Text Detectionmentioning
confidence: 99%
“…The aspect ratio of text lines varies greatly, and limited anchors cannot cover the size or aspect ratio of all objects; thus, many methods are anchor-free. Both [4] and [1] generate labels with shrunk segmentation maps, and regress the vertices or angles of the bounding box on positive pixels. [29] generates a corner map and a position-sensitive segmentation map, calculates oriented bounding boxes based on the corner map, and calculates the score for each bounding box using the position-sensitive segmentation map.…”
Section: B Oriented Objects Detectionmentioning
confidence: 99%
“…For bounding box regression based methods, they can be divided into one-stage methods and two-stage methods. One-stage methods including Deep Direct Regression [5], TextBox [12], TextBoxes++ [11], DMPNet [16], SegLink [24] and EAST [34], directly estimate bounding boxes of text regions in one step. Two-stage methods in-clude R2CNN [8], RRD [13], RRPN [22], IncepText [28] and FEN [31].…”
Section: Related Workmentioning
confidence: 99%