2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2021
DOI: 10.1109/smc52423.2021.9658799
|View full text |Cite
|
Sign up to set email alerts
|

Temporally-aware Convolutional Block Attention Module for Video Text Detection

Abstract: Scene-text spotting is a task that predicts a text area on natural scene images and recognizes its text characters simultaneously. It has attracted much attention in recent years due to its wide applications. Existing research has mainly focused on improving text region detection, not text recognition. Thus, while detection accuracy is improved, the end-to-end accuracy is insufficient. Texts in natural scene images tend to not be a random string of characters but a meaningful string of characters, a word. Ther… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 54 publications
(79 reference statements)
0
1
0
Order By: Relevance
“…The aim of text recognition, also known as optical character recognition (OCR), is to convert the text in images into digital text sequences. Many studies have been conducted on this technology owing to its wide range of real-world applications, including reading license plates and handwritten text, analyzing documents such as receipts and invoices [23,58], and analyzing road signs in automated driving and natural scenes [14,16]. However, the various fonts, lighting variations, complex backgrounds, low-quality images, occlusion, and text deformation make text recognition challenging.…”
Section: Introductionmentioning
confidence: 99%
“…The aim of text recognition, also known as optical character recognition (OCR), is to convert the text in images into digital text sequences. Many studies have been conducted on this technology owing to its wide range of real-world applications, including reading license plates and handwritten text, analyzing documents such as receipts and invoices [23,58], and analyzing road signs in automated driving and natural scenes [14,16]. However, the various fonts, lighting variations, complex backgrounds, low-quality images, occlusion, and text deformation make text recognition challenging.…”
Section: Introductionmentioning
confidence: 99%