Untitled

Alexander, Hoyle,; Wolf-Sonkin, Lawrence; Wallach, Hanna; Cotterell, Ryan; Augenstein, Isabelle

doi:10.18653/v1/n19-1065

Cited by 3 publications

(11 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…6). For predicting the sentiment labels of documents, we choose a simple procedure following Go et al (2009); Kiritchenko et al (2014); Ozdemir and Bergler (2015); Hoyle et al (2019): for each document, we replace each token with its corresponding sentiment value from a dictionary. Then, we average all values per document and pass it to a logistic regression (LR) model that is fitted on the training set to predict document labels.…”

Section: Extrinsic Evaluation: Classificationmentioning

confidence: 99%

“…Sentiment analysis is being applied in various domains from political science (Young and Soroka, 2012;Gründl, 2020;Widmann and Wich, 2022) to economics (Stephany et al, 2022) and computational social science (West et al, 2014;Falck et al, 2020;Stoehr et al, 2021). In all of these applications, there is a strong demand for domain-specific and interpretable methods (Hofman et al, 2021;Widmann and Wich, 2022) making dictionarybased sentiment analysis still a popular choice (Young and Soroka, 2012;Hoyle et al, 2019;Gründl, 2020;Friedrichs et al, 2022).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Sentiment as an Ordinal Latent Variable

Stoehr,

Cotterell,

Schein

2023

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics

View full text Add to dashboard Cite

Sentiment analysis has become a central tool in various disciplines outside of natural language processing. In particular in applied and domain-specific settings with strong requirements for interpretable methods, dictionary-based approaches are still a popular choice. However, existing dictionaries are often limited in coverage, static once annotation is completed and sentiment scales differ widely; some are discrete others continuous. We propose a Bayesian generative model that learns a composite sentiment dictionary as an interpolation between six existing dictionaries with different scales. We argue that sentiment is a latent concept with intrinsically ranking-based characteristics -the word "excellent" may be ranked more positive than "great" and "okay", but it is hard to express how much more exactly. This prompts us to enforce an ordinal scale of ordered discrete sentiment values in our dictionary. We achieve this through an ordering transformation in the priors of our model. We evaluate the model intrinsically by imputing missing values in existing dictionaries. Moreover, we conduct extrinsic evaluations through sentiment classification tasks. Finally, we present two extension: first, we present a method to augment dictionary-based approaches with word embeddings to construct sentiment scales along new semantic axes. Second, we demonstrate a Latent Dirichlet Allocation-inspired variant of our model that learns document topics that are ordered by sentiment.

show abstract

Section: Extrinsic Evaluation: Classificationmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Sentiment as an Ordinal Latent Variable

Stoehr,

Cotterell,

Schein

2023

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…Following Hoyle et al (2019a), we use sentiment as a proxy to quantify bias, which requires a sentiment lexicon for each analyzed language. We use the combined sentiment lexicon of Hoyle et al (2019b) for English words, which was shown to outperform a number of individual sentiment lexica and their straight-forward combination on a text classification task involving sentiment analysis. Unfortunately, this is only available for English.…”

Section: Sentiment Datamentioning

confidence: 99%

“…For the remaining six languages we use Senti-VAE (Hoyle et al, 2019b) -a multi-view variational autoencoder -to combine existing sentiment lexica, the same method Hoyle et al (2019a) use to generate the English sentiment lexicon. Particularly, SentiVAE combines lexica with disparate scales into a common latent representation, where the output represents the strength of each word's sentiment (positive, negative and neutral) in the form of a three-dimensional Dirichlet distribution.…”

Section: Sentiment Datamentioning

confidence: 99%

“…For Hindi, we combine the sentiment lexica from Chen and Skiena (2014), Desai (2016) and Sharan (2016). 5 Following Hoyle et al (2019b) we evaluate the resulting lexica in a text classification task on a selected dataset for each of the languages. Namely, we use the resulting lexica to automatically label instances with their sentiment, based on the average sentiment of words in each sentence.…”

Section: Sentiment Datamentioning

confidence: 99%

See 1 more Smart Citation

Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models

Stańczak,

Choudhury,

Pimentel

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

While the prevalence of large pre-trained language models has led to significant improvements in the performance of NLP systems, recent research has demonstrated that these models inherit societal biases extant in natural language. In this paper, we explore a simple method to probe pre-trained language models for gender bias, which we use to effect a multi-lingual study of gender bias towards politicians. We construct a dataset of 250k politicians from most countries in the world and quantify adjective and verb usage around those politicians' names as a function of their gender. We conduct our study in 7 languages across 6 different language modeling architectures. Our results demonstrate that stance towards politicians in pre-trained language models is highly dependent on the language used. Finally, contrary to previous findings, our study suggests that larger language models do not tend to be significantly more gender-biased than smaller ones.

show abstract

Revered and reviled: a sentiment analysis of female and male referents in three languages

Levshina,

Koptjevskaja-Tamm,

Östling

2024

Front. Commun.

View full text Add to dashboard Cite

Our study contributes to the less explored domain of lexical typology, focusing on semantic prosody and connotation. Semantic derogation, or pejoration of nouns referring to women, whereby such words acquire connotations and further denotations of social pejoration, immorality and/or loose sexuality, has been a very prominent question in studies on gender and language (change). It has been argued that pejoration emerges due to the general derogatory attitudes toward female referents. However, the evidence for systematic differences in connotations of female- vs. male-related words is fragmentary and often fairly impressionistic; moreover, many researchers argue that expressed sentiments toward women (as well as men) often are ambivalent. One should also expect gender differences in connotations to have decreased in the recent years, thanks to the advances of feminism and social progress. We test these ideas in a study of positive and negative connotations of feminine and masculine term pairs such as woman - man, girl - boy, wife - husband, etc. Sentences containing these words were sampled from diachronic corpora of English, Chinese and Russian, and sentiment scores for every word were obtained using two systems for Aspect-Based Sentiment Analysis: PyABSA, and OpenAI’s large language model GPT-3.5. The Generalized Linear Mixed Models of our data provide no indications of significantly more negative sentiment toward female referents in comparison with their male counterparts. However, some of the models suggest that female referents are more infrequently associated with neutral sentiment than male ones. Neither do our data support the hypothesis of the diachronic convergence between the genders. In sum, results suggest that pejoration is unlikely to be explained simply by negative attitudes to female referents in general.

show abstract

Untitled

Cited by 3 publications

References 17 publications

Sentiment as an Ordinal Latent Variable

Sentiment as an Ordinal Latent Variable

Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models

Revered and reviled: a sentiment analysis of female and male referents in three languages

Contact Info

Product

Resources

About