Measuring how much two documents differ is a basic task in the quantitative analysis of text. Because difference is a complex, interpretive concept, researchers often operationalize difference as distance, a mathematical function that represents documents through a metaphor of physical space. Yet the constraints of that metaphor mean that distance can only capture some of the ways that documents can relate to each other. We show how a more general concept, divergence, can help solve this problem, alerting us to new ways in which documents can relate to each other. In contrast to distance, divergence can capture enclosure relationships, where two documents differ because the patterns found in one are a partial subset of those in the other, and the emergence of shortcuts, where two documents can be brought closer through mediation by a third. We provide an example of this difference measure, Kullback-Leibler Divergence, and apply it to two worked examples: the presentation of scientific arguments in Charles Darwin's Origin of Species (1859) and the rhetorical structure of philosophical texts by Aristotle, David Hume, and Immanuel Kant. These examples illuminate the complex relationship between time and what we refer to as an archive's "enclosure architecture", and show how divergence can be used in the quantitative analysis of historical, literary, and cultural texts to reveal cognitive structures invisible to spatial metaphors. Those who study culture look for differences. Just as ethnographers might study the differences between the practices of regions, villages, or families, 1 literary scholars might study the differences between genres, modes, or periods. 2 Those who approach culture from a quantitative standpoint are no exception, and we are often tasked with the goal of measuring the differences between different, computationally identified, patterns of expression. 3 In the digital analysis of texts, for example, we might ask how much two documents or sets of documents differ, and relate these differences to other aspects of psychological or social life. 4
Feminism of the 1970s remains among the most influential social movements within the United States. Bestselling texts played a crucial role in spreading feminism beyond early activists into the mainstream of American society. Contemporary scholars of feminism continue to rely on these works as pivotal historical sources. This paper utilizes quantitative methods to compare six feminist bestsellers from 1970. Our data consists of three subcorpora of digitized books published in 1970 found in the Hathi Trust: six feminist bestsellers, a sample of non-fiction, and a sample of writing about women. Computational textual analysis identifies each bestselling title’s salient features and the contributions each text made at this key moment in the development of feminist thought. These results led us to propose a historiographical intervention that credits one bestseller, The Black Woman, with a more prominent role in the development of 1970s feminism.
No abstract
No abstract
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.