Proposing a step-change in preservation system architectures
Structured AbstractPurpose To consider how Digital Preservation system architectures will support Business Analysis of large-scale collections of preserved resources, and the use of Big Data analyses by future researchers.
Design / methodology / approachArchitectural reviews of existing systems. Experimental surveys of large digital collections using existing Digital Preservation tools at Big Data scales. Design of a proposed new architecture to work with Big Data volumes of preserved digital resources -also based upon experience of managing a collection of 30 million digital images.
FindingsModern visualisation tools enable Business Analyses based on file-related metadata, but most currentlyavailable systems need more of this functionality 'out-of-the-box'. Scalability of preservation architecture to Big Data volumes depends upon the ability to run preservation processes in parallel, so indexes that enable effective sub-division of collections are vital. Not all processes scale easily: those that don't require complex management.
In the seventeenth century, Robert Burton wrote in his Anato my of Melancholy that a study of maps was a good antidote for melancholy: ((What greater pleasure can there now be, than to view those elaborate maps of Ortelius, Mercator, Hondius, &c.? To peruse those books of cities put out by Braunus and Hogen bergius? " What, indeed? This catalogue then may be your guide to travel at the lowest price and least discomfort to that new world contained within these maps.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.