Linjie Deng scite author profile

To achieve high coverage of target boxes, a normal strategy of conventional one-stage anchor-based detectors is to utilize multiple priors at each spatial position, especially in scene text detection tasks. In this work, we present a simple and intuitive method for multi-oriented text detection where each location of feature maps only associates with one reference box. The idea is inspired from the twostage R-CNN framework that can estimate the location of objects with any shape by using learned proposals. The aim of our method is to integrate this mechanism into a onestage detector and employ the learned anchor which is obtained through a regression operation to replace the original one into the final predictions. Based on RetinaNet, our method achieves competitive performances on several public benchmarks with a totally real-time efficiency (26.5f ps at 800p), which surpasses all of anchor-based scene text detectors. In addition, with less attention on anchor design, we believe our method is easy to be applied on other analogous detection tasks. The code will publicly available at https://github.com/xhzdeng/stela.

show abstract

Detecting multi-oriented text with corner-based region proposals

Deng

Gong

Lin

et al. 2019

Neurocomputing

View full text Add to dashboard Cite

Previous approaches for scene text detection usually rely on manually defined sliding windows. This work presents an intuitive two-stage region-based method to detect multi-oriented text without any prior knowledge regarding the textual shape. In the first stage, we estimate the possible locations of text instances by detecting and linking corners instead of shifting a set of default anchors. The quadrilateral proposals are geometry adaptive, which allows our method to cope with various text aspect ratios and orientations. In the second stage, we design a new pooling layer named Dual-RoI Pooling which embeds data augmentation inside the region-wise subnetwork for more robust classification and regression over these proposals. Experimental results on public benchmarks confirm that the proposed method is capable of achieving comparable performance with state-of-the-art methods. The code is publicly available at https://github.com/xhzdeng/crpn.

show abstract

Unified Chinese License Plate detection and recognition with high efficiency

Gong

Deng

Tao

et al. 2022

Journal of Visual Communication and Image Representation

View full text Add to dashboard Cite

Generating Text Sequence Images for Recognition

Gong

Deng²,

Ma³

et al. 2020

Neural Process Lett

View full text Add to dashboard Cite

Recently, methods based on deep learning have dominated the field of text recognition. With a large number of training data, most of them can achieve the state-of-the-art performances. However, it is hard to harvest and label sufficient text sequence images from the real scenes. To mitigate this issue, several methods to synthesize text sequence images were proposed, yet they usually need complicated preceding or followup steps. In this work, we present a method which is able to generate infinite training data without any auxiliary pre/postprocess. We tackle the generation task as an image-to-image translation one and utilize conditional adversarial networks to produce realistic text sequence images in the light of the semantic ones. Some evaluation metrics are involved to assess our method and the results demonstrate that the caliber of the data is satisfactory. The code and dataset will be publicly available soon.

show abstract

Comparative proteome analysis of amniotic fluids and placentas from patients with idiopathic polyhydramnios

Cen

Wei

et al. 2020

Placenta

View full text Add to dashboard Cite

Focus-Enhanced Scene Text Recognition with Deformable Convolutions

Deng¹,

Gong²,

Lu³

et al. 2019

View full text Add to dashboard Cite

Recently, scene text recognition methods based on deep learning have sprung up in computer vision area. The existing methods achieved great performances, but the recognition of irregular text is still challenging due to the various shapes and distorted patterns. Consider that at the time of reading words in the real world, normally we will not rectify it in our mind but adjust our focus and visual fields. Similarly, through utilizing deformable convolutional layers whose geometric structures are adjustable, we present an enhanced recognition network without the steps of rectification to deal with irregular text in this work. A number of experiments have been applied, where the results on public benchmarks demonstrate the effectiveness of our proposed components and shows that our method has reached satisfactory performances. The code will be publicly available at https: //github.com/Alpaca07/dtr soon.

show abstract

Unattached irregular scene text rectification with refined objective

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Linjie Deng

A Real-Time ATC Safety Monitoring Framework Using a Deep Learning Approach

STELA: A Real-Time Scene Text Detector With Learned Anchor

Detecting multi-oriented text with corner-based region proposals

Unified Chinese License Plate detection and recognition with high efficiency

Generating Text Sequence Images for Recognition

Comparative proteome analysis of amniotic fluids and placentas from patients with idiopathic polyhydramnios

Focus-Enhanced Scene Text Recognition with Deformable Convolutions

Unattached irregular scene text rectification with refined objective

Contact Info

Product

Resources

About