2011
DOI: 10.1007/978-3-642-23160-5_1
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Search in Hidden Text of Large DjVu Documents

Abstract: The paper describes an open-source tool which allows to present endusers with results of advanced language technologies. It relies on the DjVu format, which for some applications is still superior to other modern formats including PDF/A. The DjVu GPLed tools are not limited just to the DjVuLibre library, but are being supplemented by various new programs, such as pdf2djvu developed by Jakub Wilk. It allows in particular to convert to DjVu the PDF output of popular OCR programs like FineReader preserving the hi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 3 publications
(2 reference statements)
0
2
0
Order By: Relevance
“…It was designed by the present author and implemented by Jakub Wilk. The reasons for selecting the DjVu format were presented on several occasions, such as (Bień, 2009) and (Bień, 2011). In this paper we focus on problems specific to the IMPACT data.…”
Section: Introductionmentioning
confidence: 99%
“…It was designed by the present author and implemented by Jakub Wilk. The reasons for selecting the DjVu format were presented on several occasions, such as (Bień, 2009) and (Bień, 2011). In this paper we focus on problems specific to the IMPACT data.…”
Section: Introductionmentioning
confidence: 99%
“…On the contrary, Poliqarp is currently no longer updated but, at the same time it is still under use in several projects [13][14][15].…”
Section: System Architecture and Componentsmentioning
confidence: 99%