2015
DOI: 10.1007/s00530-015-0484-3
|View full text |Cite
|
Sign up to set email alerts
|

Fast document image comparison in multilingual corpus without OCR

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
5
0
1

Year Published

2016
2016
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(6 citation statements)
references
References 23 publications
0
5
0
1
Order By: Relevance
“…In [137], dense SIFT descriptors were used. Segmentation of the document image not only into text lines but also into characters, as proposed in [139], can also be used. The results presented in this work show that the proposed method can handle images of multilingual documents with different resolutions and font sizes.…”
Section: Document Comparisonmentioning
confidence: 99%
“…In [137], dense SIFT descriptors were used. Segmentation of the document image not only into text lines but also into characters, as proposed in [139], can also be used. The results presented in this work show that the proposed method can handle images of multilingual documents with different resolutions and font sizes.…”
Section: Document Comparisonmentioning
confidence: 99%
“…The first group of methods is used to compare texts of electronic versions of documents, so such methods are not adapted for comparing images of documents, but can detect not only structural but also semantic modifications [4]. The second group is used for the comparison of document images without character recognition, in particular for layout comparison of documents [5,6]. Disadvantages of these methods are that only the spatial content of the document is compared [6].…”
Section: Introductionmentioning
confidence: 99%
“…Disadvantages of these methods are that only the spatial content of the document is compared [6]. Testing the methods described in [5] was not carried out on public dataset.…”
Section: Introductionmentioning
confidence: 99%
“…Specifically, this special issue targeted the most recent technical progresses on learning techniques for high-dimensional multimedia data, including classification [1], segmentation [2,4], feature selection [1,5,8], deep learning [5], image saliency detection [7], and many others [3,6], in many kinds of learning-based applications, including image processing sequences [1,2,7,8], text processing [1,4,6], system applications [6]. The topics of the special issue are interesting, so in total, this special issue have received 24 submissions from at least 20 different research departments over the world.…”
mentioning
confidence: 99%
“…The paper by Lin et al [4] proposed to compare document images in multilingual corpus, which was composed of character segmentation, feature extraction and similarity measure. The paper applied projection and self-adaptive threshold to analyze the layout and then segment the text line by horizontal projection.…”
mentioning
confidence: 99%