The subjectivities that shape data collection and management have received extensive criticism, especially with regards to the digitization projects and digital archives of galleries, libraries, archives and museums (GLAM institutions). The role of digital methods for recovering data absences is increasingly receiving attention too. Conceptualizing the absence of non-hegemonic individuals from the catalogues of Sir Hans Sloane as an instance of textual haunting, this article will ask: to what extent do data-driven approaches further entrench archival absences and silences? Can digital approaches be used to highlight or recover absent data? This article will give a decisive overview of relevant literature and projects so as to examine how digital tools are being realigned to recover, or more modestly acknowledge, the vast, undocumented network of individuals who have been omitted from canonical histories. Drawing on the example of Sloane, this article will reiterate the importance of a more rigorous ethics of digital practice, and propose recommendations for the management and representation of historical data, so cultural heritage institutions and digital humanists may better inform users of the absences and subjectivities that shape digital datasets and archives. This article is built on a comprehensive survey of digital humanities’ current algorithmic approaches to absence and bias. It also presents reflections on how we, the authors, grappled with unforeseen questions of absence and bias during a Leverhulme-funded collaboration between the British Museum and University College London (UCL), entitled ‘Enlightenment Architectures: Sir Hans Sloane’s Catalogues of his collections’.
PurposeBy mapping-out the capabilities, challenges and limitations of named-entity recognition (NER), this article aims to synthesise the state of the art of NER in the context of the early modern research field and to inform discussions about the kind of resources, methods and directions that may be pursued to enrich the application of the technique going forward.Design/methodology/approachThrough an extensive literature review, this article maps out the current capabilities, challenges and limitations of NER and establishes the state of the art of the technique in the context of the early modern, digitally augmented research field. It also presents a new case study of NER research undertaken by Enlightenment Architectures: Sir Hans Sloane's Catalogues of his Collections (2016–2021), a Leverhulme funded research project and collaboration between the British Museum and University College London, with contributing expertise from the British Library and the Natural History Museum.FindingsCurrently, it is not possible to benchmark the capabilities of NER as applied to documents of the early modern period. The authors also draw attention to the situated nature of authority files, and current conceptualisations of NER, leading them to the conclusion that more robust reporting and critical analysis of NER approaches and findings is required.Research limitations/implicationsThis article examines NER as applied to early modern textual sources, which are mostly studied by Humanists. As addressed in this article, detailed reporting of NER processes and outcomes is not necessarily valued by the disciplines of the Humanities, with the result that it can be difficult to locate relevant data and metrics in project outputs. The authors have tried to mitigate this by contacting projects discussed in this paper directly, to further verify the details they report here.Practical implicationsThe authors suggest that a forum is needed where tools are evaluated according to community standards. Within the wider NER community, the MUC and ConLL corpora are used for such experimental set-ups and are accompanied by a conference series, and may be seen as a useful model for this. The ultimate nature of such a forum must be discussed with the whole research community of the early modern domain.Social implicationsNER is an algorithmic intervention that transforms data according to certain rules-, patterns- or training data and ultimately affects how the authors interpret the results. The creation, use and promotion of algorithmic technologies like NER is not a neutral process, and neither is their output A more critical understanding of the role and impact of NER on early modern documents and research and focalization of some of the data- and human-centric aspects of NER routines that are currently overlooked are called for in this paper.Originality/valueThis article presents a state of the art snapshot of NER, its applications and potential, in the context of early modern research. It also seeks to inform discussions about the kinds of resources, methods and directions that may be pursued to enrich the application of NER going forward. It draws attention to the situated nature of authority files, and current conceptualisations of NER, and concludes that more robust reporting of NER approaches and findings are urgently required. The Appendix sets out a comprehensive summary of digital tools and resources surveyed in this article.
This article explores the library catalogues of the physician, President of the Royal Society, and omnivorous collector Sir Hans Sloane (1660–1753). It draws attention to how Sloane organised and catalogued his books, and situates his approach within the wider context of late seventeenth- and early eighteenth-century English book collecting and library documentation. It highlights the importance of Gabriel Naudé’s Advis pour dresser une bibliothèque and the wider genre of works on library management for English collectors, and uncovers the library practices shared by Sloane and his peers. The article concludes that Sloane adhered to many common conventions of library building, many of which were communicated through literature addressing library organisation and bibliography. However, he balanced the desire for the ideal type library with the practical demands of book ownership, mediating best practice with pragmatic solutions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.