2022
DOI: 10.48550/arxiv.2211.04963
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Pure Transformer with Integrated Experts for Scene Text Recognition

Yew Lee Tan,
Adams Wai-kin Kong,
Jung-Jae Kim

Abstract: Scene text recognition (STR) involves the task of reading text in cropped images of natural scenes. Conventional models in STR employ convolutional neural network (CNN) followed by recurrent neural network in an encoder-decoder framework. In recent times, the transformer architecture is being widely adopted in STR as it shows strong capability in capturing long-term dependency which appears to be prominent in scene text images. Many researchers utilized transformer as part of a hybrid CNN-transformer encoder, … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 33 publications
(41 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?