2019 IEEE/CVF International Conference on Computer Vision (ICCV) 2019
DOI: 10.1109/iccv.2019.00917
|View full text |Cite
|
Sign up to set email alerts
|

TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
73
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 176 publications
(79 citation statements)
references
References 29 publications
0
73
0
Order By: Relevance
“…In this section, we compare our network with the stateof-the-art approaches [1], [3], [10], [11], [16], [18], [20], [21], [23], [24], [27]- [29], [43], [45], [49], [65], [66], [69], [69], [71]- [73] on six different benchmark datasets. We consider recall, precision, and f-measure as the metrics for evaluation of accuracy of detection.…”
Section: Comparison With State-of-the-art Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In this section, we compare our network with the stateof-the-art approaches [1], [3], [10], [11], [16], [18], [20], [21], [23], [24], [27]- [29], [43], [45], [49], [65], [66], [69], [69], [71]- [73] on six different benchmark datasets. We consider recall, precision, and f-measure as the metrics for evaluation of accuracy of detection.…”
Section: Comparison With State-of-the-art Resultsmentioning
confidence: 99%
“…Mask TextSpotter [11] uses semantic segmentation to detect text of arbitrary shapes and spatial attention for handling text instances of irregular shapes by simultaneously considering local and global textual information. TextDragon [73] describes the shape of text with a sequence of quadrangles to handle the text of arbitrary shapes and RoISlide that connect a deep network and connectionist temporal classification based text recognizer. The labeling of locations of characters is not needed.…”
Section: Scene Text Spottingmentioning
confidence: 99%
“…It can be seen from Table II that we have also achieved very good performance on the CTW1500 data set. Especially in the text detection task, it is 1.1% higher than [30].…”
Section: ) Curved Textmentioning
confidence: 93%
“…Recently, in order to sufficiently exploit the complementarity between detection and recognition, many methods [45], [4], [5], [6], [46], [7], [17], [47], [37], [48], [49], [50] are proposed to spot text in an end-to-end manner, which utilize the recognition information to optimize the localization task.…”
Section: A Text Reading In Single Imagesmentioning
confidence: 99%
“…In fact, it is a very challenging task to optimize video text spotter end-to-end when taking multiple functional modules (text detection, text tracking and text recognition) into consideration, especially compared to the traditional four-staged pipeline strategy. Therefore, in this paper we develop an endto-end trainable video text spotter with only two trainable modules: the video text detector and the text recommender, similar to the end-to-end text spotting methods [6], [17], [45], [47], [48], [49] in single images.…”
Section: B Text Reading In Videosmentioning
confidence: 99%