2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00436
|View full text |Cite
|
Sign up to set email alerts
|

Learning Shape-Aware Embedding for Scene Text Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
101
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 215 publications
(108 citation statements)
references
References 37 publications
0
101
0
Order By: Relevance
“…In this section, we compare our network with the stateof-the-art approaches [1], [3], [10], [11], [16], [18], [20], [21], [23], [24], [27]- [29], [43], [45], [49], [65], [66], [69], [69], [71]- [73] on six different benchmark datasets. We consider recall, precision, and f-measure as the metrics for evaluation of accuracy of detection.…”
Section: Comparison With State-of-the-art Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In this section, we compare our network with the stateof-the-art approaches [1], [3], [10], [11], [16], [18], [20], [21], [23], [24], [27]- [29], [43], [45], [49], [65], [66], [69], [69], [71]- [73] on six different benchmark datasets. We consider recall, precision, and f-measure as the metrics for evaluation of accuracy of detection.…”
Section: Comparison With State-of-the-art Resultsmentioning
confidence: 99%
“…They use oriented region proposal network and oriented region-of-interest pooling layer to map arbitrary-oriented region proposals to a feature tensor for text classification. Tian et al project pixels onto an embedding space, where they consider pixels of same text instances appear closer to each other [23]. The authors in [24] incorporate normalization of scale and orientation of text instances to map to a desired canonical geometry range.…”
Section: A Scene Text Detectionmentioning
confidence: 99%
“…Ext 85.3 67.9 75.6 Wang et al [9] 80.2 80.1 80.1 Tian et al [13] 77.8 82.7 80.1 PSENet [12] 79.7 84.8 82.2 DB [12] 80.2 86.9 83.4 Ours 80.3 84.9 82.5 : The single-scale results on CTW1500. "R", "P" and "F" represent the recall, precision, and F-measure respectively.…”
Section: Methodsmentioning
confidence: 99%
“…However, it has many output kernels which may have negative effects on location results. [13] adopts a mirror symmetry of FPN [14] to produce embedding features and text foreground masks, and uses cluster processing to detect texts. DB [15] proposes a Differentiable Binarization module to predict the shrunk regions, and the shrunk regions are dilated with an constant expanding ratio.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation