2019
DOI: 10.1145/3231737
|View full text |Cite
|
Sign up to set email alerts
|

Convolutional Attention Networks for Scene Text Recognition

Abstract: In this article, we present Convoluitional Attention Networks (CAN) for unconstrained scene text recognition. Recent dominant approaches for scene text recognition are mainly based on Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN), where the CNN encodes images and the RNN generates character sequences. Our CAN is different from these methods; our CAN is completely built on CNN and includes an attention mechanism. The distinctive characteristics of our method include (i) CAN follows enc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
19
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 62 publications
(21 citation statements)
references
References 27 publications
0
19
0
Order By: Relevance
“…However, robustness to distortion and generality to variant language are challenging for these systems. To explore the advancement in TIE techniques, [57] and as encoder in attention mechanism outperformed others [56]. Although, these techniques are showing promising results, but diversity in data sources makes the system complex [55].…”
Section: Text Recognitionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, robustness to distortion and generality to variant language are challenging for these systems. To explore the advancement in TIE techniques, [57] and as encoder in attention mechanism outperformed others [56]. Although, these techniques are showing promising results, but diversity in data sources makes the system complex [55].…”
Section: Text Recognitionmentioning
confidence: 99%
“…CNN based OCR have also shown pretty good results but the performance of technique on unstructured big datasets is still to be investigated. The attention mechanism is a new approach in text recognition [54,56]. Initially, the results are satisfactory but there is a huge room for improvement in terms of unstructured and multidimensional big data.…”
Section: Text Recognitionmentioning
confidence: 99%
“…SURF [54] and ROI based visual codebook are respectively applied for adult image detection [40]. Some other approaches [45][46][47][48][49][50][51][52][53] are worthy to be noticed.…”
Section: Related Workmentioning
confidence: 99%
“…Inspired by the recent successes of convolutional neural networks (CNNs) [12]- [15] in high level computer vision tasks, deep neural networks (DNNs) emerged in addressing low level computer vision tasks as well [16]- [23]. For the task of arXiv:1812.10836v3 [cs.CV] 17 Nov 2019 image inpainting, Pathak et al [21] presented an auto-encoder to perform context-based image inpainting.…”
Section: Introductionmentioning
confidence: 99%