2023
DOI: 10.3390/s23208610
|View full text |Cite
|
Sign up to set email alerts
|

Scene Uyghur Recognition Based on Visual Prediction Enhancement

Yaqi Liu,
Fanjie Kong,
Miaomiao Xu
et al.

Abstract: Aiming at the problems of Uyghur oblique deformation, character adhesion and character similarity in scene images, this paper proposes a scene Uyghur recognition model with enhanced visual prediction. First, the content-aware correction network TPS++ is used to perform feature-level correction for skewed text. Then, ABINet is used as the basic recognition network, and the U-Net structure in the vision model is improved to aggregate horizontal features, suppress multiple activation phenomena, better describe th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 42 publications
0
1
0
Order By: Relevance
“…Scene Text Recognition Methods Employing Joint Learning of Internal Language Models: Unlike external guided language models, internal language models typically participate in the entire model training process and learn contextual semantic information during training. Methods based on RNN [1,38,[46][47][48][49][50] capture the temporal information of characters in the text through RNN, thereby understanding the dependency relationships between characters in the text. The TRBA [1] model adopts BiLSTM for sequence modeling, aiming to better comprehend the contextual information between characters.…”
Section: Related Workmentioning
confidence: 99%
“…Scene Text Recognition Methods Employing Joint Learning of Internal Language Models: Unlike external guided language models, internal language models typically participate in the entire model training process and learn contextual semantic information during training. Methods based on RNN [1,38,[46][47][48][49][50] capture the temporal information of characters in the text through RNN, thereby understanding the dependency relationships between characters in the text. The TRBA [1] model adopts BiLSTM for sequence modeling, aiming to better comprehend the contextual information between characters.…”
Section: Related Workmentioning
confidence: 99%