2006 10th IEEE International Enterprise Distributed Object Computing Conference Workshops (EDOCW'06) 2006
DOI: 10.1109/edocw.2006.29
|View full text |Cite
|
Sign up to set email alerts
|

Document Layout Analysis and Classification and Its Application in OCR

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2008
2008
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 7 publications
0
5
0
Order By: Relevance
“…For example, text in a scanned document may not be easily searchable or editable without additional processing such as optical character recognition (OCR), which converts the image-based text into machine-readable text (Gupta, 2006;Hsu et al, 2022).…”
Section: Literature Reviewmentioning
confidence: 99%
“…For example, text in a scanned document may not be easily searchable or editable without additional processing such as optical character recognition (OCR), which converts the image-based text into machine-readable text (Gupta, 2006;Hsu et al, 2022).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Many methods for understanding document images have been investigated to extract and classify meaningful information from documents. Classical methods of layout analysis [1] involve performing morphological operations, connected component analysis, and then classifying extracted features into their constituent regions. Recent advances in highly precise and robust deep neural networks have led to the exploration of deep learning-based layout analysis as well.…”
Section: Introductionmentioning
confidence: 99%
“…[25] performs horizontal dilation and the length of the line is fixed so that the lines can be extracted completely and heuristic rules are applied for text and non-text region classification. Region growing and analysis with heuristic rule is also applied where the generic modeling of paper layout is known [26]. Connected component generation and heuristics are also applied in [27].…”
Section: Introductionmentioning
confidence: 99%