Provenance disclosure—the documentation of an artifact’s origin and how it was produced—is an important aspect to consider when working with historical records which undergo multiple transformations in preparation for and during digitization. Provenance in this context is commonly communicated through explanatory text or static diagrams. However, the methodological and curatorial decisions that have influenced the records’ data are easily overlooked, in particular when exploring the records through visualization as a result of digitization processes. We propose a data-driven approach to provenance disclosure which (1) traces provenance back to when the records were created, (2) documents and categorizes the records’ transformations (transcriptions, content modifications, changes in organization, and representational form), and (3) uses data visualization to disclose provenance in interactive ways. We reflect on how this approach can be practically applied in the context of historical record collections, and we present findings from a qualitative study we conducted to investigate the merits and limitations of provenance-driven visualization. Our findings suggest that data-driven provenance disclosure has the potential to (1) promote transparency and deeper interpretations of historical records, (2) provide rigor in researching historical document collections and underlying production processes, and (3) encourage ethical considerations by making visible labor and implicit bias that influence the production and curation of historical records.
Transcription, annotation, digitization and/or visualization are common transformations that historical documents such as national records, birth/death registers, university records, letters or books undergo. Reasons for those transformations span from the (physical) protection of the original materials to disclosure of "hidden" information or patterns within the documents. Even though such transformations bring new insights and perspectives on the documents, they also modify the documents' content, structure, and/or artifactual form and thus, occlude prior knowledge and interpretation. When it comes to visualization as a means to transform historical documents from written to abstract visual form, there is typically little acknowledgment or even understanding of the previous transformation steps these documents have gone through. The "tremendous rhetorical force" [3] of visualization, we argue, should not be at the expense of the multiple pasts, contexts, and curators that are inherent in historical record collections. Rather, the urgent question for the fields of visualization and the (digital) humanities is how to better support awareness of these multiple layers of interpretation and the people behind them when representing historical documents. We begin to address this question based on a collection of historical university records by (a) investigating common transformation processes of historical documents, and (b) discussing opportunities and challenges for making such transformations transparent through what we call "provenance-driven visualization"; the idea for a visualization that makes visible the layers of transformation (including interpretation, re-structuring, and curation) inherent in historical documents.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.