2022
DOI: 10.1007/978-3-031-19815-1_15
|View full text |Cite
|
Sign up to set email alerts
|

GLASS: Global to Local Attention for Scene-Text Spotting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(8 citation statements)
references
References 41 publications
0
8
0
Order By: Relevance
“…On Total-Text, we surpass all current state-of-the-art in both settings. Some of these prior arts [21,49,67] fine-tune their models on Total-Text which boosts the performance on this target dataset at the cost of dropping performance on others. Also note that, some prior arts [21,22,26,44] limit recognition to case-insensitive letters and no punctuation symbols, while ours operate in a case-sensitive mode, a more difficult but more important one.…”
Section: Comparison With State-of-the-art Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…On Total-Text, we surpass all current state-of-the-art in both settings. Some of these prior arts [21,49,67] fine-tune their models on Total-Text which boosts the performance on this target dataset at the cost of dropping performance on others. Also note that, some prior arts [21,22,26,44] limit recognition to case-insensitive letters and no punctuation symbols, while ours operate in a case-sensitive mode, a more difficult but more important one.…”
Section: Comparison With State-of-the-art Resultsmentioning
confidence: 99%
“…Text detection stage produces bounding polygons or rotated bounding boxes for text instances at one granularity, usually words. Text instances are cropped from input image pixels [4], encoded backbone features [26,45], or both [49]. The text recognition stage decodes the text transcription.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…While there are many advanced OCR technologies that one can apply [7,8,9,10,11], we aim at carrying out all AI computation on-device for better privacy, connection and latency. To build a prototype quickly, our first system is modularized to the following three major components: word detection, word recognition, grouping and ordering.…”
Section: Baseline Ocr Systemmentioning
confidence: 99%