2018
DOI: 10.48550/arxiv.1805.01167
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
20
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 18 publications
(20 citation statements)
references
References 16 publications
0
20
0
Order By: Relevance
“…Similarly, Huang et al [32] update feature extraction network derived from Pyramid Attention Network [33], and add a text mask prediction branch that detects curved texts. In addition, Yang et al [34] and Dai et al [35] use a fully convolutional instance-aware semantic segmentation (FCIS) [36] method to guide the prediction of three text-related elements: mask, class, and box, by generating an instance-aware segmentation perspective. Wang et al [37] propose an Adaptive-RPN with a scale-insensitive metric to accurately generate proposal bounding boxes, and then add contour characteristic of text regions by executing the convolution operation in two orthogonal directions to locate texts with arbitrary shapes.…”
Section: Combination Of Segmentation and Regression Methodsmentioning
confidence: 99%
“…Similarly, Huang et al [32] update feature extraction network derived from Pyramid Attention Network [33], and add a text mask prediction branch that detects curved texts. In addition, Yang et al [34] and Dai et al [35] use a fully convolutional instance-aware semantic segmentation (FCIS) [36] method to guide the prediction of three text-related elements: mask, class, and box, by generating an instance-aware segmentation perspective. Wang et al [37] propose an Adaptive-RPN with a scale-insensitive metric to accurately generate proposal bounding boxes, and then add contour characteristic of text regions by executing the convolution operation in two orthogonal directions to locate texts with arbitrary shapes.…”
Section: Combination Of Segmentation and Regression Methodsmentioning
confidence: 99%
“…Precision Recall F-measure SegLink [37] 73.10 76.80 75.00 SSTD [8] 80.00 73.00 77.00 WordSup [12] 79.33 77.03 78.16 EAST * [46] 83.27 78.33 80.72 R2CNN [17] 85.62 79.68 82.54 DDR [10] 82.00 80.00 81.00 Lyu et al * [31] 89.50 79.70 84.30 RRD * [24] 88.00 80.00 83.80 TextBoxes++ * [22] 87.80 78.50 82.90 PixelLink [3] 85.50 82.00 83.70 FOTS [26] 91.00 85.17 87.99 IncepText * [42] 89.40 84.30 86.80 TextSnake [29] 84.90 80.40 82.60 FTSN [2] 88.60 80.00 84.10 SPCNET [41] 88…”
Section: Methodsmentioning
confidence: 99%
“…SmoothL1 Loss. SmoothL1 loss function is one of the most common loss functions for the bounding box regression task, such as in [1,19,23,24,29,32,33], as defined below:…”
Section: Regression Loss Functionmentioning
confidence: 99%
“…For the expression of regression terms of RBox, two main types are categorized. Following the idea in [32], the bounding box regression was categorized into two branches, which are direct regression and indirect regression. The indirect regression method is derived from R-CNN, computing a set of offsets using ground truth and prior boxes, as expressed in Eq.…”
Section: Rbox Regression Parametersmentioning
confidence: 99%