2020
DOI: 10.1109/access.2020.3001605
|View full text |Cite
|
Sign up to set email alerts
|

Cursive Character Recognition in Natural Scene Images Using a Multilevel Convolutional Neural Network Fusion

Abstract: The accuracy of current natural scene text recognition algorithms is limited by the poor performance of character recognition methods for these images. The complex backgrounds, variations in the writing, text size, orientations, low resolution and multi-language text make recognition of text in natural images a complex and challenging task. Conventional machine learning and deep learning-based methods have been developed that have achieved satisfactory results, but character recognition for cursive text such a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 28 publications
(19 citation statements)
references
References 77 publications
0
18
0
1
Order By: Relevance
“…In order to verify the effectiveness of each component of the proposed method, we propose two variants based on the ETM method: single-ETM and TM. The single-ETM method 3 https://www.openml.org/ uses a single-objective optimization framework, which only considers the accuracy performance of the model to be optimized. The other settings are consistent with the ETM method.…”
Section: ) Comparison Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In order to verify the effectiveness of each component of the proposed method, we propose two variants based on the ETM method: single-ETM and TM. The single-ETM method 3 https://www.openml.org/ uses a single-objective optimization framework, which only considers the accuracy performance of the model to be optimized. The other settings are consistent with the ETM method.…”
Section: ) Comparison Methodsmentioning
confidence: 99%
“…So far, machine learning and deep learning has made great progress in many works on the image recognition field [1]- [3]. However, machine learning and deep learning still need many tedious processes in practical applications.…”
Section: Introductionmentioning
confidence: 99%
“…They have been adapted for feature extraction in many text recognition systems. We can cite for example: scene text recognition [12], [13], video text recognition [14], and offline handwriting text recognition [15]- [17]. However, CNN-based or DL-based approaches are still deficient.…”
Section: ) Feature Extractionmentioning
confidence: 99%
“…The output feature map Z can be regarded as the set of all channel feature maps Z k . Finally, calculate the activation value V k of each channel, where Z k (i, j ) is the pixel activation value of the feature map of channel k, as shown in Equation (5).…”
Section: Channel Quantization and Deep Neural Networkmentioning
confidence: 99%
“…IET Image Processing published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology recognition may suffer from complex backgrounds, uneven lighting, low contrast, blurry texts, text directions, font colors, writing styles, and mixed languages. Hence, character recognition in natural scenes has become a major study focus in this field [4][5][6]. In recent years, with the widespread attention to the bag-of-words (BOW) [7], many methods have been proposed to segment a character into small image "words", such as the end of a stroke, a curved stroke, or a cross stroke, and these small words can successfully increase the recognition rate.…”
Section: Introductionmentioning
confidence: 99%