Cross-Domain Topic Classification for Political Texts

Osnabrügge, Moritz; Ash, Elliott; Morelli, Massimo

doi:10.1017/pan.2021.37

Cited by 23 publications

(35 citation statements)

References 36 publications

(54 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Using the text classification method, we can automate many types of analyses in political science. As listed in the examples in Figure 2, researchers can detect political perspective of news articles (Huguet Cabot et al, 2020), the stance in media on a certain topic (Luo et al, 2020), whether campaigns use positive or negative sentiment (Ansolabehere and Iyengar, 1995), which issue area is the legislation about (Adler and Wilkerson, 2011), topics in parliament speech (Albaugh et al, 2013;Osnabrügge et al, 2021), congressional bills (Hillard et al, 2008;Collingwood and Wilkerson, 2012) and political agenda (Karan et al, 2016), whether the international statement is peaceful or belligerent (Schrodt, 2000), whether a speech contains positive or negative sentiment (Schumacher et al, 2016), and whether a U.S. Circuit Courts case decision is conservative or liberal (Hausladen et al, 2020).…”

Section: Nlp For Text Analysismentioning

confidence: 99%

Natural Language Processing for Policymaking

Jin

Mihalcea

2022

Handbook of Computational Social Science for Policy

View full text Add to dashboard Cite

Language is the medium for many political activities, from campaigns to news reports. Natural language processing (NLP) uses computational tools to parse text into key information that is needed for policymaking. In this chapter, we introduce common methods of NLP, including text classification, topic modelling, event extraction, and text scaling. We then overview how these methods can be used for policymaking through four major applications including data collection for evidence-based policymaking, interpretation of political decisions, policy communication, and investigation of policy effects. Finally, we highlight some potential limitations and ethical concerns when using NLP for policymaking.

show abstract

Section: Nlp For Text Analysismentioning

confidence: 99%

Natural Language Processing for Policymaking

Jin

Mihalcea

2022

Handbook of Computational Social Science for Policy

View full text Add to dashboard Cite

show abstract

“…Researchers often adopt a weighting scheme, called term frequency-inverse document frequency (TF-IDF), that gives more weight to less frequent words. The main advantage of dictionary methods is the ease of interpretation, while the main disadvantage is the low design efficiency: before conducting any analysis, researchers must spend a significant amount of time designing a classification scheme, by compiling an exhaustive list of keywords that belong to each category (Osnabrügge et al, 2021). A second method for automated text analysis is probabilistic topic modeling.…”

Section: Independent Variables: Imf Program Participation and Conditi...mentioning

confidence: 99%

“…A topic is a distribution over a fixed vocabulary (Blei, 2012); for example, the topic natural resources has a fixed vocabulary that includes words like oil , mining , and hydrocarbon . Topic models have high design efficiency (Osnabrügge et al, 2021), because they do not require training sets and are suitable for new discoveries: they can parse the data to identify hidden patterns that are not immediately evident to the human eye (like the unobservable influence of IMF conditionality on domestic legislation).…”

Section: Data and Descriptive Analysismentioning

confidence: 99%

“…The former approach requires researchers to develop a priori expectations about the vocabulary and the topics contained in the documents under study, while the latter can identify hidden patterns that are not immediately evident to the human eye (like the unobservable influence of IMF conditionality on domestic legislation). Dictionary methods have high interpretability and specificity: the resulting classification scheme can be easily interpreted and used to answer specific questions or explore particular data features (Osnabrügge et al, 2021). In contrast, topic models have moderate to low interpretability and specificity, because they require researchers to specify the number of desired topics, label each topic, and interpret the results, all of which are subjective decisions (Wilkerson & Casas, 2017).…”

Section: Data and Descriptive Analysismentioning

confidence: 99%

See 1 more Smart Citation

Examining the effect of IMF conditionality on natural resource policy

Goes

2022

Economics & Politics

View full text Add to dashboard Cite

Can International Monetary Fund (IMF) lending improve natural resource governance in borrowing countries? While most IMF agreements mandate policy reforms in exchange for financial support, compliance with these reforms is mixed at best. The natural resource sector should be no exception. After all, resource windfalls enable short‐term increases in discretionary spending, and office‐seeking politicians are often unwilling to forgo this discretion by reforming the oil, gas, or mining sector. I investigate how and when borrowers go against their political interests and establish natural resource funds—a tool often promoted by the IMF—in the wake of a loan agreement. Using text analysis, statistical models, and qualitative evidence from natural resource policy and IMF conditionality for 74 countries between 1980 and 2019, I show that borrowers under an IMF agreement are more likely to create or regulate a resource fund, particularly if the agreement includes binding conditions that highlight the salience of natural resource reforms. This study contributes to extant research by proposing a new method to extract information from IMF conditions, by introducing a novel dataset on country‐level natural resource policy, and by identifying under what circumstances international reform efforts can help combat the resource curse.

show abstract

“…We are interested in forming a predicted probability of the source of a document for scoring influence in a second corpus. Other related methods arePeterson and Spirling (2018) andOsnabrügge et al (2021).5 We have fewer snippets from FNC than from CNN/MSNBC. Thus, we randomly under-sample the snippets from the CNN/MSNBC corpus to match the number of snippets from FNC.6 Previous work has shown that supervised learning models using n-grams are rarely sensitive to the specific choices in pre-processing and featurization (e.g.,Denny and Spirling, 2018).…”

mentioning

confidence: 99%

Media Slant is Contagious

Widmer¹,

Galletta²,

Ash³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

This paper analyzes the influence of partisan content from national cable TV news on local reporting in U.S. newspapers. We provide a new machine-learning-based measure of cable news slant, trained on a corpus of 40K transcribed TV episodes from Fox News Channel (FNC), CNN, and MSNBC (2005-2008). Applying the method to a corpus of 24M local newspaper articles, we find that in response to an exogenous increase in local viewership of FNC relative to CNN/MSNBC, local newspaper articles become more similar to FNC transcripts (and vice versa). Consistent with newspapers responding to changes in reader preferences, we see a shift in the framing of local news coverage rather than just direct borrowing of cable news content. Further, cable news slant polarizes local news content: right-leaning newspapers tend to adopt right-wing FNC language, while left-leaning newspapers tend to become more left-wing. Media slant is contagious.

show abstract

Cross-Domain Topic Classification for Political Texts

Cited by 23 publications

References 36 publications

Natural Language Processing for Policymaking

Natural Language Processing for Policymaking

Examining the effect of IMF conditionality on natural resource policy

Media Slant is Contagious

Contact Info

Product

Resources

About