1987
DOI: 10.1002/scj.4690180410
|View full text |Cite
|
Sign up to set email alerts
|

A method of document‐image segmentation based on projection profiles, stroke densities and circumscribed rectangles

Abstract: A method is proposed wherein printed documents are segmented into the following three areas: headline area, textline area, and attached area. Then the character lines constituting headline and textline are extracted. This paper describes: 1) the combination of the global features of the document such as projection profiles and stroke densities, and the local features of the document such as circumscribed rectangles are used; 2) the basic features of document elements such as character line periodicity in the t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

1991
1991
1998
1998

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 2 publications
0
5
0
Order By: Relevance
“…The conventional methods for detecting character strings are as follows: ( 1) a method using projection histograms (the histograms are calculated several times while changing the range and scanning direction), (5) (2) a method using the properties (e.g. size and ratio of height to width) of components of connected black pixels,(6) (3) a method using the properties of components of connected black pixels in a pre-processed (e.g.…”
Section: Character String Detection Algorithmmentioning
confidence: 99%
“…The conventional methods for detecting character strings are as follows: ( 1) a method using projection histograms (the histograms are calculated several times while changing the range and scanning direction), (5) (2) a method using the properties (e.g. size and ratio of height to width) of components of connected black pixels,(6) (3) a method using the properties of components of connected black pixels in a pre-processed (e.g.…”
Section: Character String Detection Algorithmmentioning
confidence: 99%
“…The preprocessing step shown in Fig. 3 performs noise reduction, correction of an inclined image [4], and so forth. The run length smoothing step decreases the number of connected components in the document image so that the connected component extraction and hierarchical segmentation processing are sped up.…”
Section: Segmentation Systemmentioning
confidence: 99%
“…Over the past few years, several approaches have been proposed: methods which use the structural or statistical properties of document images [l, 2, 5-81, methods which use knowledge processing techniques [ 101, interactive processing methods [3], and a hybrid method which uses both the properties of image pixels and knowledge processing techniques [4]. But problems remain with the generality or robustness of the algorithms.…”
Section: Introductionmentioning
confidence: 99%
“…In the method of creating the connected components of circumscribing rectangles or black pixels, it is pointed out that the incorrect extraction might occur if the contact existed between characters or between character and picture [6]. This problem will be overcome by considering the local orientation of text lines.…”
Section: Introductionmentioning
confidence: 98%