Cultural Cartography with Word Embeddings

Stoltz, Dustin S.; Taylor, Marshall A.

doi:10.31235/osf.io/5djcn

Cited by 10 publications

(16 citation statements)

References 124 publications

(178 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Then, a word embedding model (Skip Gram Model) analyzed the closest words around the identified terms (e.g., we, Marylanders, people) per state, creating standardized values of closeness. Word embedding models are recognized as being particularly well-suited for text analysis focused on meaning ( Nelson 2021 ; Stoltz and Taylor 2021 ). Thus, the composite measure describes a set of words that revolve around context specific plural words that represent the construction of collective intentionality in the press conferences.…”

Section: Methodsmentioning

confidence: 99%

“Do your part: Stay apart”: Collective intentionality and collective (in)action in US governor's COVID-19 press conferences

Kirgil

Voyer

2022

Poetics

View full text Add to dashboard Cite

Section: Methodsmentioning

confidence: 99%

“Do your part: Stay apart”: Collective intentionality and collective (in)action in US governor's COVID-19 press conferences

Kirgil

Voyer

2022

Poetics

View full text Add to dashboard Cite

“…This transformation, which mirrors the distributional hypothesis of language, allows moving problems of semantic similarity from a frequentist ("how often were certain terms used") to an geometric framework ("how close are certain terms with respect to their contexts of sematic use") (Kozlowski et al 2019). The process is 'cartographic' (Stoltz and Taylor 2020), identifying similarities in the use of the tokens in a corpus rather than "actual" meanings. It is, however, powerful for describing how linguistic units evolve over time in their context of use (Bizzoni et al 2019;Hamilton, Leskovec, and Jurafsky 2016;Szymanski 2017) and relation to terms associated to key social cleavages (Garg et al 2018;Rice, Rhodes, and Nteta 2019).…”

Section: Scaffoldingmentioning

confidence: 99%

The extended computational case method: a framework for research design

Guerra¹

2022

Preprint

View full text Add to dashboard Cite

This paper considers the adoption of computational analysis within research designs modeled after the extended case method. Echoing calls to augment the power of contemporary researchers through the adoption of computational text analysis methods, we offer a framework for thinking about how such techniques can be integrated into quasi-ethnographic workflows to address structural sociological claims. We focus, in particular, on how this adoption of novel forms of evidence impacts corpus design and interpretation (which we tie to matters of casing), theoretical elaboration (which we associate to moving empirical claims across scales), and verification (which we see as a process of reflexive scaffolding of theoretical claims). We provide an example of the use of this framework through a study of the marketization of social scientific knowledge in the United Kingdom.

show abstract

“…We use a word embedding to simulate the actors within the transmission chains. Word embeddings model the meaning of words by representing them as vectors, so that words that appear in semantically similar contexts are close to one another in the embedding space (for sociological adaptation, see Kozlowski et al 2019;Stoltz and Taylor 2020). A common approach to estimating word vectors is the Word2Vec algorithm Mikolov, Sutskever et al 2013), which uses an artificial neural network to learn word vectors by repeatedly (1) taking a passage from the corpus, (2) omitting a word from that passage, (3) attempting to guess the missing word based on the vectors of the context words, (4) assessing the correctness of its guess, and (5) adjusting the word vectors to make future guesses more accurate.…”

Section: Simulating An Actor Using a Word Embeddingmentioning

confidence: 99%

All Roads Lead to Polenta: Cultural Attractors at the Junction of Public and Personal Culture

2021

View full text Add to dashboard Cite

In the process of retelling information, individuals often inadvertently transform it to be more consistent with their cultural schemas. We explore the long-term cultural change inherent in this process, focusing on utterances about cultural tastes as our case study (e.g., music, food, and outdoor hobbies). We use a word embedding model to simulate a "telephone game" where each actor partially hears an utterance, uses their cultural schemas to guess the missing word, and tells the result to the next actor. While laboratory "telephone games" explore short transmission chains of approximately four steps, our approach lets us simulate these chains out to 1000 steps. We find that these chains are often pulled toward powerful "cultural attractors"-essentially points of least resistance where communications end up through transmission error alone. Moreover, some attractors operate across taste domains: transmission chains gravitate toward these attractors regardless of which cultural domain they begin in. The most powerful such attractor we located concerns high-status, broadly liked food. Taste in food may thus have an underappreciated centrality within personal taste: verbal accounts describing taste in food may be particularly stable across multiple retellings, while accounts about other taste domains may become transformed into accounts of taste in food.

show abstract

Cultural Cartography with Word Embeddings

Cited by 10 publications

References 124 publications

“Do your part: Stay apart”: Collective intentionality and collective (in)action in US governor's COVID-19 press conferences

“Do your part: Stay apart”: Collective intentionality and collective (in)action in US governor's COVID-19 press conferences

The extended computational case method: a framework for research design

All Roads Lead to Polenta: Cultural Attractors at the Junction of Public and Personal Culture

Contact Info

Product

Resources

About