This paper summarizes the 3rd Book Structure Extraction competition that was run at the ICDAR 2013. Its goal is to evaluate and compare automatic techniques for deriving structure information from digitized books, which could then be used to aid navigation inside the books. More specifically, the task that participants are faced with is to construct hyperlinked tables of contents for a collection of 1,000 digitized books. This paper reviews the setup of the competition, the book collection used in the task, and the measures used for the evaluation. The main novelty of the 2013 competition is that we were able to rely on an external provider for the ground truthing phase, hence granting the consistency of the evaluation. In addition, this permitted to nearly double the number of annotated books from the 1,040 books annotated in 2009 and 2011 to over 2,000 books. The paper further presents the result performance of the 6 participating research teams, and briefly summarizes their approaches.
While document delivery of journal articles has seen some remarkable progress in the last years, interlibrary loan (ILL) of monographs is still being carried out in the way it started from its very beginnings in the 19 th century; to satisfy the need of a single user the book is delivered from site to site by a postal service. In contrast to this traditional method the BOOKS2U! project will investigate the possibility of a new approach where the requested monograph is digitised and the digital surrogates are made available on a digital library Website. The project is funded by the European Union under the Information Society Technologies Program and carried out by the libraries of the Universities of Graz and Innsbruck and the department of Computer Science for the Blind (i3s3) of the University of Linz i ,which together are forming the ALO (Austrian Literature Online) working group. The results of the project will be summarised in a feasibility study ii .
The tranScriptorium project aims to develop innovative, efficient and cost-effective solutions for annotating handwritten historical documents using modern, holistic Handwritten Text Recognition (HTR) technology. Three actions are planned in tranScriptorium: i) improve basic image preprocessing and holistic HTR techniques; ii) develop novel indexing and keyword searching approaches; and iii) capitalize on new, user-friendly interactive-predictive HTR approaches for computer-assisted operation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.