Sequence-To-Sequence Domain Adaptation Network for Robust Text Image Recognition

Zhang, Yaping; Nie, Shuai; Liu, Wenju; Xu, Xing; Zhang, Dongxiang; Shen, Heng Tao

doi:10.1109/cvpr.2019.00285

Cited by 129 publications

(49 citation statements)

References 31 publications

(31 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…On the other hand, Gao et al [31] proposed an end-to-end fully convolutional network with the stacked convolutional layers to capture the long-term dependencies among elements of scene text image. Besides, Zhang et al [32] develops a sequence-tosequence Domain Adaptation Network (SSDAN) that introduces a gated attention similarity unit to align the distribution of the source and target sequence data. Bartz et al [33] proposed a semi-supervised neural network for simultaneously scene text detection and recognition.…”

Section: A Methods Towards Scene Text Recognitionmentioning

confidence: 99%

See 1 more Smart Citation

PGC-Net: A Light Weight Convolutional Sequence Network for Digital Pressure Gauge Calibration

Lian

et al. 2019

IEEE Access

View full text Add to dashboard Cite

Automatic digital pressure gauge calibration is challenging due to various unconstrained conditions. Although existing CNN-RNN based methods have been almost perfect on scene text recognition, they fail to perform well on digital pressure gauge calibration that requires to be extremely computationefficient and accurate. In this paper, we propose a light weight fully convolutional sequence recognition network for fast and accurate digital Pressure Gauge Calibration (PGC-Net). PGC-Net integrates feature extraction, sequence modelling and transcription into a unified framework. Experimental results show that PGC-Net runs 28 fps on CPU with 97.41% accuracy. Compared with previous methods, PGC-Net achieves better or comparable performance at lower inference time. Without bells and whistles, PGC-Net is capable of recognizing decimal points that usually appear in pressure gauge images, which evidently verifies the feasibility of PGC-Net. We collected a dataset that contains 17, 240 gauge images with annotated labels for automatic digital pressure gauge calibration. The dataset has been public for future research.

show abstract

Section: A Methods Towards Scene Text Recognitionmentioning

confidence: 99%

“…Bartz et al [33] proposed a semi-supervised neural network for simultaneously scene text detection and recognition. However, the method of [26], [32]- [35] adopted deep convolutional backbone to extract image feature and are too time-consuming for AGC.…”

Section: A Methods Towards Scene Text Recognitionmentioning

confidence: 99%

PGC-Net: A Light Weight Convolutional Sequence Network for Digital Pressure Gauge Calibration

Lian

et al. 2019

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Inspired by CTC, (Bai et al 2018) proposed a "Edit Probability" to optimize the training process, as missing or superfluity of characters may mislead CTC training. (Zhang et al 2019) also introduced a domain adaption method to varying length text recognition. The major approach for recent regular text recognition methods is still CTC-based, which enforces the alignment between feature sequence and labels.…”

Section: Related Workmentioning

confidence: 99%

GTC: Guided Training of CTC towards Efficient and Accurate Scene Text Recognition

Cai²,

Hou³

et al. 2020

AAAI

103

View full text Add to dashboard Cite

Connectionist Temporal Classification (CTC) and attention mechanism are two main approaches used in recent scene text recognition works. Compared with attention-based methods, CTC decoder has a much shorter inference time, yet a lower accuracy. To design an efficient and effective model, we propose the guided training of CTC (GTC), where CTC model learns a better alignment and feature representations from a more powerful attentional guidance. With the benefit of guided training, CTC model achieves robust and accurate prediction for both regular and irregular scene text while maintaining a fast inference speed. Moreover, to further leverage the potential of CTC decoder, a graph convolutional network (GCN) is proposed to learn the local correlations of extracted features. Extensive experiments on standard benchmarks demonstrate that our end-to-end model achieves a new state-of-the-art for regular and irregular scene text recognition and needs 6 times shorter inference time than attention-based methods.

show abstract

“…Chen et al [165] proposed a multinomial adversarial network (MAN) to address the text recognition problem by using the adversarial approach. In [166, 167], the encoder–decoder models are introduced for text recognition problem. Zhan et al [168] presented geometry‐aware domain adaptation network (GA‐DAN), which models the shift between domains in both geometry and appearance spaces, and converts images with different characteristics across domains.…”

Section: Unsupervised Domain Adaptation For Other Applicationsmentioning

confidence: 99%

Deep visual unsupervised domain adaptation for classification tasks: a survey

et al. 2020

View full text Add to dashboard Cite

Sequence-To-Sequence Domain Adaptation Network for Robust Text Image Recognition

Cited by 129 publications

References 31 publications

PGC-Net: A Light Weight Convolutional Sequence Network for Digital Pressure Gauge Calibration

PGC-Net: A Light Weight Convolutional Sequence Network for Digital Pressure Gauge Calibration

GTC: Guided Training of CTC towards Efficient and Accurate Scene Text Recognition

Deep visual unsupervised domain adaptation for classification tasks: a survey

Contact Info

Product

Resources

About