2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00619
|View full text |Cite
|
Sign up to set email alerts
|

Rotation-Sensitive Regression for Oriented Scene Text Detection

Abstract: Text in natural images is of arbitrary orientations, requiring detection in terms of oriented bounding boxes. Normally, a multi-oriented text detector often involves two key tasks: 1) text presence detection, which is a classification problem disregarding text orientation; 2) oriented bounding box regression, which concerns about text orientation. Previous methods rely on shared features for both tasks, resulting in degraded performance due to the incompatibility of the two tasks. To address this issue, we pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
224
0
2

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
3
2

Relationship

0
10

Authors

Journals

citations
Cited by 460 publications
(226 citation statements)
references
References 43 publications
(99 reference statements)
0
224
0
2
Order By: Relevance
“…7 (c)(d)). Large character spacing is an unresolved problem which also exists in other state-of-the-art methods such as RRD [28]. For symbol detection and false positives, PAN is trained on small datasets (about 1000 images) and we believe this problem will be alleviated when increasing training data.…”
Section: Failure Samplesmentioning
confidence: 99%
“…7 (c)(d)). Large character spacing is an unresolved problem which also exists in other state-of-the-art methods such as RRD [28]. For symbol detection and false positives, PAN is trained on small datasets (about 1000 images) and we believe this problem will be alleviated when increasing training data.…”
Section: Failure Samplesmentioning
confidence: 99%
“…Note that 'MS' denotes multi-scale testing of the trained models. Compared with the previous methods, e.g., RRD [24] and Border [41] , our baseline text reading models ('End2End' and 'End2End-MS') show slightly better performance in F-score for detection, which is tested in single and multiple scales, respectively. Note that the end-to-end baseline of ICDAR 2017-RCTW [36] marked with + used a large synthetic dataset with a Chinese Figure 6: Matching examples generated by OPM module.…”
Section: Comparisons With Other Approachesmentioning
confidence: 81%
“…Even the multi-scale, our method runs at a speed of 10.5 fps. Compared with recent methods [21,17,23,4], our method is comparable with accuracy and efficiency.…”
Section: Comparison To State Of the Artmentioning
confidence: 85%