2013
DOI: 10.1016/j.procs.2013.05.056
|View full text |Cite
|
Sign up to set email alerts
|

A New Approach to Detect and Extract Characters from Off-Line Printed Images and Text

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(8 citation statements)
references
References 6 publications
0
8
0
Order By: Relevance
“…Text zones were determined by examining vertical lines by rules, white run pixels were used for segmentation and bounding box coordinates for each related component are used to compute the height of a certain character. Choudhary, Rishi & Ahlawat [16] proposed a method to segment character images from text containing images and good results are achieved which shows the strength of the proposed character detection and extraction technique.…”
Section: B Preprocessing 1) Binarizationmentioning
confidence: 96%
“…Text zones were determined by examining vertical lines by rules, white run pixels were used for segmentation and bounding box coordinates for each related component are used to compute the height of a certain character. Choudhary, Rishi & Ahlawat [16] proposed a method to segment character images from text containing images and good results are achieved which shows the strength of the proposed character detection and extraction technique.…”
Section: B Preprocessing 1) Binarizationmentioning
confidence: 96%
“…It calculates the average grey value for each pixel column then split every blank region in the middle, making it vulnerable to disconnected structure and touching characters. Recently, improved methods have been proposed but are only specific for single language [1,2,5,6,13,16,17,19,23]. Other researches exploit complex processing pipelines and hand-crafted rules to tackle multilingual cases [4,10,24,25].…”
Section: Related Workmentioning
confidence: 99%
“…One major reason for poor recognition accuracy in OCR system is the error in character segmentation. Some previous researches [1,2,5,6,13,16,17,19,23] achieve high performance on monolingual texts, but rely on feature engineering specific to single character style. Other researches [4,10,24,25] work on multilingual cases but introduce complex processing pipelines.…”
Section: Introductionmentioning
confidence: 99%
“…A pruning process is proposed with some knowledge about characters as well as size constrains proposed in [9]. Amit Choudharya et al [10 ] proposed an approach contains several steps. These steps could be summarized as: noise removal (image enhancement), thresholding to turn to binary image in an inverted form, labeling process to segment character images, connected foreground object repeatedly extracted, insignificant and large objects dropped, and finally the remaining labels inverted and considered potential objects.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, it gained the attention of the researchers" community. The features presented by the community include [6][7][8][9][10][11][12][13][14][15][16] Classifiers, according to [17], are built based on similarity, probability or decision boundaries. Similarity classifiers use similarity metrics to measure the closeness to the class members or the class preset representative(s).…”
Section: Introductionmentioning
confidence: 99%