2010
DOI: 10.1007/s10032-010-0135-3
|View full text |Cite
|
Sign up to set email alerts
|

Digital weight watching: reconstruction of scanned documents

Abstract: A web portal providing access to over 250.000 scanned and OCRed cultural heritage documents is analyzed. The collection consists of the complete Dutch Hansard from 1917 to 1995. Each document consists of facsimile images of the original pages plus hidden OCRed text. The inclusion of images yields large file sizes of which less than 2% is the actual text. The search user interface of the portal provides poor ranking and not very informative document summaries (snippets). Thus, users are instrumental in weeding … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2011
2011
2023
2023

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 16 publications
0
1
0
Order By: Relevance
“…The category ‘others’ includes 11% of the selected works that do not fall in any of the aforementioned categories. These works propose techniques such as the reconstruction of documents from digitised images to improve their readability [76], large interactive displays for data exploration and analysis [62], multimedia edition of geospatial narratives [37] and digital storytelling [43].…”
Section: Resultsmentioning
confidence: 99%
“…The category ‘others’ includes 11% of the selected works that do not fall in any of the aforementioned categories. These works propose techniques such as the reconstruction of documents from digitised images to improve their readability [76], large interactive displays for data exploration and analysis [62], multimedia edition of geospatial narratives [37] and digital storytelling [43].…”
Section: Resultsmentioning
confidence: 99%