2016 12th IAPR Workshop on Document Analysis Systems (DAS) 2016
DOI: 10.1109/das.2016.67
|View full text |Cite
|
Sign up to set email alerts
|

Text Extraction in Document Images: Highlight on Using Corner Points

Abstract: During past years, text extraction in document images has been widely studied in the general context of Document Image Analysis (DIA) and especially in the framework of layout analysis. Many existing techniques rely on complex processes based on preprocessing, image transforms or component/edges extraction and their analysis. At the same time, text extraction inside videos has received an increased interest and the use of corner or key points has been proven to be very effective. Because it is noteworthy to no… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 30 publications
0
7
0
Order By: Relevance
“…4. Though the technique is simple compared to other proposed [37,38], it gives an excellent result with 100% accuracy. Fig.…”
Section: Checking Systemmentioning
confidence: 97%
“…4. Though the technique is simple compared to other proposed [37,38], it gives an excellent result with 100% accuracy. Fig.…”
Section: Checking Systemmentioning
confidence: 97%
“…MAP, for a set of queries, is the mean of the average precision scores for each query. To define the score, we first define Precision ( ) and Recall ( ), as in Equations ( 18) and (19), respectively.…”
Section: Performance Measurementioning
confidence: 99%
“…Here, it should be noted that the techniques that extract words from document images might be erroneous sometimes and can add some computational overhead if a segmentation-based approach is adopted. However, the state-of-the-art word extraction methods [ 18 , 19 , 20 ] that perform the tasks with good efficiency on complex documents, while consuming less time, could be used to get rid of these issues. For a similar reason, text line-based techniques are computationally more expensive when compared to word-based methods.…”
Section: Introductionmentioning
confidence: 99%
“…The algorithm can filter the background, which contains text image of printed documents. Vikas and Ragot [16] designed a very simple technique based on FAST key points to extract texts from document images. The image is divided into blocks and the point density of each block is computed.…”
Section: A Layout Segmentationmentioning
confidence: 99%