Topics, Concepts, and Measurement: A Crowdsourced Procedure for Validating Topics as Measures

Ying, Luwei; Montgomery, Jacob M.; Stewart, Brandon

doi:10.1017/pan.2021.33

Cited by 35 publications

(33 citation statements)

References 51 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…UML methods cluster data without relying on manual coding, finding similar words or texts in the data and grouping them together . While they are best suited for exploratory analyses, these methods can be repurposed to measurement of concepts of interest in text Ying, Montgomery, and Stewart 2022). Here we assess their performance in capturing subtle internal states in short texts using the following procedure: (1) we use UML models to group texts, (2) examine whether some of these groups correspond to our coding categories and, if so, (3) code texts that belong to these groups for corresponding categories.…”

Section: Step 2-methods Choicementioning

confidence: 99%

“…Here we assess their performance in capturing subtle internal states in short texts using the following procedure: (1) we use UML models to group texts, (2) examine whether some of these groups correspond to our coding categories and, if so, (3) code texts that belong to these groups for corresponding categories. While we rely on our interpretation and expertise in matching groups to coding categories, researchers can resort to more sophisticated methods for the evaluation, validation, and labelling of results obtained with UML models (Ying et al 2022). In this overview, we survey several UML algorithms.…”

Section: Step 2-methods Choicementioning

confidence: 99%

See 1 more Smart Citation

A Systematic Evaluation of Text Mining Methods for Short Texts: Mapping Individuals’ Internal States from Online Posts

Macanovic¹,

Przepiorka²

2022

Preprint

View full text Add to dashboard Cite

Sociologists have successfully used text mining to investigate discourse using news articles, official documents, and other sources. Yet, the potential of exploring millions of short texts generated spontaneously by individuals in online environments has remained untapped within the field. To fill this gap, we show how such texts can inform sociologists about individual internal states such as norms, motives, and stances, which thus far have been mainly elicited using surveys. We assess the performance of 581 variations of three text mining approaches–dictionary methods, supervised, and unsupervised machine learning–against the benchmark of texts coded by humans for complex schemes capturing individuals’ internal states. Our analysis includes coding feedback texts from an online market for motives for leaving feedback (N = 2,000) and tweet texts for moral values expressed in text (N = 3,832). We describe challenges arising with these different approaches and provide best-practice advice for future applications.

show abstract

Section: Step 2-methods Choicementioning

confidence: 99%

Section: Step 2-methods Choicementioning

confidence: 99%

A Systematic Evaluation of Text Mining Methods for Short Texts: Mapping Individuals’ Internal States from Online Posts

Macanovic¹,

Przepiorka²

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…The researchers appreciate that there are existing topic modelling techniques and machine learning approaches for computationally extracting topics and classifying users 34 – 36 . However, the researchers have gone through the pain of manually annotating the data because of our research interest to further develop knowledge about the identified topics and user categories.…”

Section: Methodsmentioning

confidence: 99%

Twitter data from the 2019–20 Australian bushfires reveals participatory and temporal variations in social media use for disaster recovery

Ogie

Moore

Wickramasuriya

et al. 2022

Sci Rep

View full text Add to dashboard Cite

Social media platforms have proved to be vital sources of information to support disaster response and recovery. A key issue, though, is that social media conversation about disasters tends to tail off after the immediate disaster response phase, potentially limiting the extent to which social media can be relied on to support recovery. This situation motivates the present study of social media usage patterns, including who contributes to social media around disaster recovery, which recovery activities they contribute to, and how well that participation is sustained over time. Utilising Twitter data from the 2019–20 Australian bushfires, we statistically examined the participation of different groups (citizens, emergency agencies, politicians and others) across categories of disaster recovery activity such as donations & financial support or mental health & emotional support, and observed variations over time. The results showed that user groups differed in how much they contributed on Twitter around different recovery activities, and their levels of participation varied with time. Recovery-related topics also varied significantly with time. These findings are valuable because they increase our understanding of which aspects of disaster recovery currently benefit most from social media and which are relatively neglected, indicating where to focus resources and recovery effort.

show abstract

“…This path forward is promising but also entails risks, because there is no guarantee that an algorithm operating on word frequencies will arrive at a meaningful definition of a cultural category. For this reason, unsupervised methods to summarize text data always place the burden on the researcher to justify their chosen interpretation and validate the utility of the topics learned (Grimmer et al, 2022;Grimmer and Stewart, 2013;Ying et al, 2021). LDA thus illustrates a key idea that applies more broadly to unsupervised methods: while these methods may appear to inductively discover insights from the data alone, they actually involve extensive theoretical work on the part of the researcher to justify and interpret the result.…”

Section: Dimension Reduction: Unsupervised Machine Learning Can Summa...mentioning

confidence: 99%

Researcher reasoning meets computational capacity: Machine learning for social science

Lundberg¹,

Brand²,

Jeon³

2022

Social Science Research

View full text Add to dashboard Cite

Topics, Concepts, and Measurement: A Crowdsourced Procedure for Validating Topics as Measures

Cited by 35 publications

References 51 publications

A Systematic Evaluation of Text Mining Methods for Short Texts: Mapping Individuals’ Internal States from Online Posts

A Systematic Evaluation of Text Mining Methods for Short Texts: Mapping Individuals’ Internal States from Online Posts

Twitter data from the 2019–20 Australian bushfires reveals participatory and temporal variations in social media use for disaster recovery

Researcher reasoning meets computational capacity: Machine learning for social science

Contact Info

Product

Resources

About