2020
DOI: 10.1609/aaai.v34i07.6864
|View full text |Cite
|
Sign up to set email alerts
|

Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting

Abstract: Many approaches have recently been proposed to detect irregular scene text and achieved promising results. However, their localization results may not well satisfy the following text recognition part mainly because of two reasons: 1) recognizing arbitrary shaped text is still a challenging task, and 2) prevalent non-trainable pipeline strategies between text detection and text recognition will lead to suboptimal performances. To handle this incompatibility problem, in this paper we propose an end-to-end traina… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 79 publications
(60 citation statements)
references
References 36 publications
0
41
0
Order By: Relevance
“…A. Spatial-Temporal Video Text Detector 1) Text Detection in Single Frame: Since TP [17] is a more robust text spotter than EAST [16] especially on irregular text detection, and also can be trained end-to-end. We redesign and implement the video text detector inspired by the TP architecture (including a text detection module, a shape transform module and a recognition module), as shown in Figure 2.…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…A. Spatial-Temporal Video Text Detector 1) Text Detection in Single Frame: Since TP [17] is a more robust text spotter than EAST [16] especially on irregular text detection, and also can be trained end-to-end. We redesign and implement the video text detector inspired by the TP architecture (including a text detection module, a shape transform module and a recognition module), as shown in Figure 2.…”
Section: Methodsmentioning
confidence: 99%
“…With the rapid development of artificial intelligence techniques [18], [19], [20], [21], great progress has been made in many isolated applications such as causal inference [22], named entities identification [23], question answering [24], scene text spotting [5], [6], [17] and video understanding [25], [26]. However, it is very important to build multiple knowledge representation [27] for understanding the real and complex world.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations