2020
DOI: 10.1007/s11042-020-09832-3
|View full text |Cite
|
Sign up to set email alerts
|

BINYAS: a complex document layout analysis system

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(9 citation statements)
references
References 19 publications
0
9
0
Order By: Relevance
“…By contrast, bottom-up methods are able to handle more kinds of documents including namely non-Manhattan layout pages and so on, but they need higher computational cost as an exchange. Hybrid methods integrate these two methods, and one of the most representative methods is CC analysis [7], [8], [9], [10], [11], [12]: CCs are detected from the the entire images first, and then researchers analyze these CCs to acquire areas of interest. Hybrid methods combine the benefits of bottom-up and top-down methods, they can handle a variety of documents with relatively fast speed.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…By contrast, bottom-up methods are able to handle more kinds of documents including namely non-Manhattan layout pages and so on, but they need higher computational cost as an exchange. Hybrid methods integrate these two methods, and one of the most representative methods is CC analysis [7], [8], [9], [10], [11], [12]: CCs are detected from the the entire images first, and then researchers analyze these CCs to acquire areas of interest. Hybrid methods combine the benefits of bottom-up and top-down methods, they can handle a variety of documents with relatively fast speed.…”
Section: Discussionmentioning
confidence: 99%
“…To better distinguish small texts between large graphics, existing studies usually fill contours by different means: filling the whole regions according to different rules [11], [12], [9] or performing hole-filled morphological closing [10]. However, there are two problems in our scenario that filling is inapplicable to.…”
Section: A Coarse Segmentationmentioning
confidence: 99%
“…Image processing techniques and convolutional neural networks are commonly used in the literature for this purpose [7]- [9]. โ€ข Optical Character Recognition (OCR): The obtained images are converted into digitized text by OCR.…”
Section: โ€ข Text and Table Detection: Extraction Of Informationmentioning
confidence: 99%
“…Precision, Recall, F-Score, and Accuracy, which are commonly used in the literature, are used as a performance metrics to evaluate the success of the system. The Precision, Recall, F-Score, and Accuracy values were calculated using equation ( 5), ( 6), (7), and ( 8) respectively.…”
Section: ๐‘๐ถ๐ธ๐‘… = 100 ๐‘† + ๐ท + ๐ผ ๐ป + ๐‘† + ๐ท + ๐ผmentioning
confidence: 99%
“…Researchers have also used Haar Discrete Wavelet Transform (DWT) for segmenting text from document images by detecting the edges and then using the line feature, vector graph based on the edge map and the stroke, and finally, the text is segmented by line feature [12]. In [13] classification of text and non-text components is performed using connected components and pixel based approach. Statistical approaches have also been used for text and non-text classification on handwritten documents [14] and works on the extraction of text and graphics from different scripts of newspapers [15].…”
Section: Introductionmentioning
confidence: 99%