Lise Jaillant scite author profile

Access to data is seen as a key priority today. Yet, the vast majority of digital cultural data preserved in archives is inaccessible due to privacy, copyright or technical issues. Emails and other born-digital collections are often uncatalogued, unfindable and unusable. In the case of documents that originated in paper format before being digitised, copyright can be a major obstacle to access. To solve the problem of access to digital archives, cross-disciplinary collaborations are absolutely essential. The big challenges of our time—from global warming to social inequalities—cannot be solved within a single discipline. The same applies to the challenge of “dark” archives closed to users. We cannot expect archivists or digital humanists to find a magical solution that will instantly make digital records more accessible. Instead, we need to set up collaborations across disciplines that seldom talk to each other. Based on 21 interviews with 26 archivists, librarians and other professionals in cultural institutions, we identify key obstacles to making digitised and born-digital collections more accessible to users. We outline current levels of access to a wide range of collections in various cultural organisations, including no access at all and limited access (for example, when users are required to travel on-site to consult documents). We suggest possible solutions to the problems of access—including the ethical use of Artificial Intelligence to unlock “dark” archives inaccessible to users. Finally, we propose the creation of a global user community who would participate in decisions on access to digital collections.

show abstract

Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data

Jaillant

Caputo

2022

AI & Soc

View full text Add to dashboard Cite

Co-authored by a Computer Scientist and a Digital Humanist, this article examines the challenges faced by cultural heritage institutions in the digital age, which have led to the closure of the vast majority of born-digital archival collections. It focuses particularly on cultural organizations such as libraries, museums and archives, used by historians, literary scholars and other Humanities scholars. Most born-digital records held by cultural organizations are inaccessible due to privacy, copyright, commercial and technical issues. Even when born-digital data are publicly available (as in the case of web archives), users often need to physically travel to repositories such as the British Library or the Bibliothèque Nationale de France to consult web pages. Provided with enough sample data from which to learn and train their models, AI, and more specifically machine learning algorithms, offer the opportunity to improve and ease the access to digital archives by learning to perform complex human tasks. These vary from providing intelligent support for searching the archives to automate tedious and time-consuming tasks. In this article, we focus on sensitivity review as a practical solution to unlock digital archives that would allow archival institutions to make non-sensitive information available. This promise to make archives more accessible does not come free of warnings for potential pitfalls and risks: inherent errors, "black box" approaches that make the algorithm inscrutable, and risks related to bias, fake, or partial information. Our central argument is that AI can deliver its promise to make digital archival collections more accessible, but it also creates new challenges - particularly in terms of ethics. In the conclusion, we insist on the importance of fairness, accountability and transparency in the process of making digital archives more accessible.

show abstract

Introduction: Global Modernism

Jaillant¹,

Martin²

2018

Modernist Cultures

View full text Add to dashboard Cite

Introduction

Jaillant¹

2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lise Jaillant

After the digital revolution: working with emails and born-digital records in literary and publishers’ archives

How can we make born-digital and digitised archives more accessible? Identifying obstacles and solutions

Unlocking digital archives: cross-disciplinary perspectives on AI and born-digital data

Introduction: Global Modernism

Introduction

Contact Info

Product

Resources

About