Dictionaries and distributions: Combining expert knowledge and large scale textual data content analysis

Garten, Justin; Hoover, Joe; Johnson, Kate; Boghrati, Reihane; Iskiwitch, Carol; Dehghani, Morteza

doi:10.3758/s13428-017-0875-9

Cited by 133 publications

(171 citation statements)

References 50 publications

(44 reference statements)

Supporting

Mentioning

150

Contrasting

Unclassified

Order By: Relevance

“…More recently, Garten et al [36] employed the MFD to detect moral rhetoric in general, and more specifically, shifts in long political speeches over time. Then, based on psychological dictionaries and semantic similarity to quantify the presence of moral sentiment around a given topic, Garten et al [37], proposed the Distributed Dictionary Representations (DDR) method. Showing promising results, DDR was also employed by Hoover et al [38] to detect moral values in charitable giving.…”

Section: Related Literaturementioning

confidence: 99%

MoralStrength: Exploiting a moral lexicon and embedding similarity for moral foundations prediction

Araque

Gatti

Kalimeri

2020

Knowledge-Based Systems

View full text Add to dashboard Cite

Moral rhetoric plays a fundamental role in how we perceive and interpret the information we receive, greatly influencing our decision-making process. Especially when it comes to controversial social and political issues, our opinions and attitudes are hardly ever based on evidence alone. The Moral Foundations Dictionary (MFD) was developed to operationalize moral values in the text. In this study, we present MoralStrength, a lexicon of approximately 1,000 lemmas, obtained as an extension of the Moral Foundations Dictionary, based on Word-Net synsets. Moreover, for each lemma it provides with a crowdsourced numeric assessment of Moral Valence, indicating the strength with which a lemma is expressing the specific value. We evaluated the predictive potentials of this moral lexicon, defining three utilization approaches of increased complexity, ranging from lemmas' statistical properties to a deep learning approach of word embeddings based on semantic similarity. Logistic regression models trained on the features extracted from MoralStrength, significantly outperformed the current state-of-the-art, reaching an F1-score of 87.6% over the previous 62.4% (p-value< 0.01), and an average F1-Score of 86.25% over six different datasets. Such findings pave the way for further research, allowing for an in-depth understanding of moral narratives in text for a wide range of social issues.

show abstract

Section: Related Literaturementioning

confidence: 99%

MoralStrength: Exploiting a moral lexicon and embedding similarity for moral foundations prediction

Araque

Gatti

Kalimeri

2020

Knowledge-Based Systems

View full text Add to dashboard Cite

show abstract

“…Beyond focusing on an under-explored domain, this research demonstrates how cutting-edge, data-driven Natural Language Processing (NLP) methods can be used with theoretical constraints and integrated into an experimental research paradigm. In Study 1, we estimate the semantic association between charitable donation sentiment and moral values using DDR (Garten et al, 2017), an NLP framework that uses distributed representations (Le & Mikolov, 2014;Mikolov, Yih, & Zweig, 2013) learned by a neural network to measure the presence of latent semantic constructs in short texts. Specifically, we rely on DDR to model the association between expressions of moral values and language associated with charitable donation in a corpus of tweets posted during and after Hurricane Sandy.…”

Section: Current Workmentioning

confidence: 99%

“…This approach pairs a theoretically constrained exploratory social media study with subsequent confirmatory experimental studies. To generate exploratory hypotheses, we estimate a set of hierarchical linear models using measurements obtained via a recently developed Natural Language Processing algorithm, Distributed Dictionary Representation (DDR; Garten et al, 2017), that harnesses the power of data-driven language modeling but also offers the precision of theory-driven measurement specificity. We then programmatically test these hypotheses with a series of preregistered, confirmatory experiments.…”

mentioning

confidence: 99%

Moral Framing and Charitable Donation: Integrating Exploratory Social Media Analyses and Confirmatory Experimentation

Hoover

Johnson

Boghrati

et al. 2018

Collabra: Psychology

Self Cite

View full text Add to dashboard Cite

Do appeals to moral values promote charitable donation during natural disasters? Using Distributed Dictionary Representation, we analyze tweets posted during Hurricane Sandy to explore associations between moral values and charitable donation sentiment. We then derive hypotheses from the observed associations and test these hypotheses across a series of preregistered experiments that investigate the effects of moral framing on perceived donation motivation (Studies 2 & 3), hypothetical donation (Study 4), and real donation behavior (Study 5). Overall, we find consistent positive associations between moral care and loyalty framing with donation sentiment and donation motivation. However, in contrast with people's perceptions, we also find that moral frames may not actually have reliable effects on charitable donation, as measured by hypothetical indications of donation and real donation behavior. Overall, this work demonstrates that theoretically constrained, exploratory social media analyses can be used to generate viable hypotheses, but also that such approaches should be paired with rigorous controlled experiments.

show abstract

“…The word-level embeddings are Word2Vec (Mikolov et al, 2013) embeddings learned from the age 50 essay training set; words that appeared less than ten times were replaced with an out-of-vocabulary token. This approach is similar to that of Garten et al (2017), which uses embeddings to capture semantic similarity when applying psychological lexica. It's also similar in motivation to metrics like TERp (Snover et al, 2009) and ME-TEOR (Denkowski and Lavie, 2014) which leverage semantic similarity for evaluating language generation.…”

Section: Innovation Challengementioning

confidence: 99%

CLPsych 2018 Shared Task: Predicting Current and Future Psychological Health from Childhood Essays

Lynn¹,

Goodman²,

Niederhoffer³

et al. 2018

Proceedings of the Fifth Workshop on Computational Linguistics And Clinical Psychology: From Keyboard to Clinic

View full text Add to dashboard Cite

We describe the shared task for the CLPsych 2018 workshop, which focused on predicting current and future psychological health from an essay authored in childhood. Language-based predictions of a person's current health have the potential to supplement traditional psychological assessment such as questionnaires, improving intake risk measurement and monitoring. Predictions of future psychological health can aid with both early detection and the development of preventative care. Research into the mental health trajectory of people, beginning from their childhood, has thus far been an area of little work within the NLP community. This shared task represents one of the first attempts to evaluate the use of early language to predict future health; this has the potential to support a wide variety of clinical health care tasks, from early assessment of lifetime risk for mental health problems, to optimal timing for targeted interventions aimed at both prevention and treatment.

show abstract

Dictionaries and distributions: Combining expert knowledge and large scale textual data content analysis

Cited by 133 publications

References 50 publications

MoralStrength: Exploiting a moral lexicon and embedding similarity for moral foundations prediction

MoralStrength: Exploiting a moral lexicon and embedding similarity for moral foundations prediction

Moral Framing and Charitable Donation: Integrating Exploratory Social Media Analyses and Confirmatory Experimentation

CLPsych 2018 Shared Task: Predicting Current and Future Psychological Health from Childhood Essays

Contact Info

Product

Resources

About