Proposing a step-change in preservation system architectures
Structured AbstractPurpose To consider how Digital Preservation system architectures will support Business Analysis of large-scale collections of preserved resources, and the use of Big Data analyses by future researchers.
Design / methodology / approachArchitectural reviews of existing systems. Experimental surveys of large digital collections using existing Digital Preservation tools at Big Data scales. Design of a proposed new architecture to work with Big Data volumes of preserved digital resources -also based upon experience of managing a collection of 30 million digital images.
FindingsModern visualisation tools enable Business Analyses based on file-related metadata, but most currentlyavailable systems need more of this functionality 'out-of-the-box'. Scalability of preservation architecture to Big Data volumes depends upon the ability to run preservation processes in parallel, so indexes that enable effective sub-division of collections are vital. Not all processes scale easily: those that don't require complex management.
PurposeThis paper aims to investigate the requirements for automating aspects of the appraisal process for digital objects. It explores these requirements in the context of automating re‐appraisal and questions many of the assumptions commonly made about appraisal and about automating the processes needed for life‐cycle management of digital objects.Design/methodology/approachThe literature of digital preservation and curation and the experience of one of the authors in planning to implement a digital archive at the Wellcome Library are the basis of an exploration of issues.FindingsThe development of automated appraisal systems and associated tools is a worthwhile endeavour, although the complexity and cost associated with designing, developing and implementing them may be prohibitive in some situations. An automated appraisal system may, however, have only limited benefits in some contexts. The re‐appraisal of technical attributes of digital materials, which is an essential part of their management, is a prime contender for some level of automation. The approach proposed has limitations which arise from such factors as metadata requirements and trustworthiness.Originality/valueThe paper articulates assumptions made about automation and applies these in order to gain a better understanding of the requirements of automating aspects of appraisal in a digital archive.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.