2019
DOI: 10.1109/access.2019.2908933
|View full text |Cite
|
Sign up to set email alerts
|

FTPN: Scene Text Detection With Feature Pyramid Based Text Proposal Network

Abstract: Scene text detection is to detect the position of a text in the natural scene, the quality of which will directly affect the subsequent text recognition. It plays an important role in fields such as image retrieval and autopilot. How to perform multi-scale and multi-oriented text detection in the scene still remains as a problem. This paper proposes an effective scene text detection method that combines the convolutional neural network (CNN) and recurrent neural network (RNN). In order to better adapt to texts… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
13
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 31 publications
(16 citation statements)
references
References 35 publications
0
13
0
Order By: Relevance
“…On ICDAR 2013, a dataset from natural scene images with presence of horizontal or near horizontal text, Pelee-Text++ reached competitive results with F-measure of 79.72% and 85.78% for its 768 and multi-scale versions, defeating Pelee-Text with a model 13 Megabytes lighter (see Table 7). Despite having a lower F-measure compared to state-of-theart methods, such as CRAFT [1], MaskTextSpotter [38], PMTD [32] and FTPN [31]; our proposal obtained a good trade-off between efficacy and model size. Figure 4a shows the balance between effectiveness and model size compared to state-of-the-art methods.…”
Section: A Detecting English Textmentioning
confidence: 92%
See 3 more Smart Citations
“…On ICDAR 2013, a dataset from natural scene images with presence of horizontal or near horizontal text, Pelee-Text++ reached competitive results with F-measure of 79.72% and 85.78% for its 768 and multi-scale versions, defeating Pelee-Text with a model 13 Megabytes lighter (see Table 7). Despite having a lower F-measure compared to state-of-theart methods, such as CRAFT [1], MaskTextSpotter [38], PMTD [32] and FTPN [31]; our proposal obtained a good trade-off between efficacy and model size. Figure 4a shows the balance between effectiveness and model size compared to state-of-the-art methods.…”
Section: A Detecting English Textmentioning
confidence: 92%
“…The proposed approaches have used different techniques for dealing with scene text detection, most of them are focused on regression [6,28,31,51,57], segmentation [4,8,56,56], or the combination of both techniques [35,38,39,52,63]. Additionally, some proposals have merged the detection and recognition tasks as a jointed pipeline for improving their results [34,38], or had even used knowledge distillation [68].…”
Section: Related Work a Scene Text Detectionmentioning
confidence: 99%
See 2 more Smart Citations
“…The construction of the FPN aims to extract high-resolution and segmentation features by combining the output of the BU and TD pathways, but it takes a long time and consumes memory. At the same time, the development of computational processor devices like the graphics processing units (GPUs) have contributed to the improvement and development of image classification and recognition by introducing effective methods, like the fully convolutional network (FCN) [37], residual network (ResNet) and squeeze and excitation (SENet) [20,21,[37][38][39].…”
Section: Introductionmentioning
confidence: 99%