2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) 2017
DOI: 10.1109/icdar.2017.46
|View full text |Cite
|
Sign up to set email alerts
|

CNN Based Page Object Detection in Document Images

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
24
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
1
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(29 citation statements)
references
References 11 publications
0
24
0
Order By: Relevance
“…This model is able to detect four different kinds of page objects: table, bar chart, pie chart and line chart. Xiaohan Yi et al [20] proposed a dynamic programming based region proposal method for page object detection.…”
Section: B Deep Learning Based Methodsmentioning
confidence: 99%
“…This model is able to detect four different kinds of page objects: table, bar chart, pie chart and line chart. Xiaohan Yi et al [20] proposed a dynamic programming based region proposal method for page object detection.…”
Section: B Deep Learning Based Methodsmentioning
confidence: 99%
“…Yi et al [10] presented a page object detection method using region proposal CNNs, followed by a custom algorithm to refine proposed regions, and a CNN classifier for object category classification. It first pre-processes the input image by applying a component-based region proposal algorithm customized for document images, which extracts the rough region proposals at the initial stage and prunes them later.…”
Section: Related Workmentioning
confidence: 99%
“…Most of the early approaches heavily relied on heuristics-which are task-specific-and thus fail to generalize to novel scenarios [7,8]. Deep-learning based models have been leveraged for this segmentation in the more recent past [2,[9][10][11][12]. All of these methods involve a significant amount of pre or post-processing based on hand-designed heuristics.…”
Section: Introductionmentioning
confidence: 99%
“…In both papers the number of training items is relatively high and the results are evaluated only considering the accuracy of the model without taking into account the recall. Other authors used Faster R-CNN for page layout identification [18], for comic character face detection [15], and for arrow localization on handwritten industrial inspection sheets [5].…”
Section: Previous Workmentioning
confidence: 99%