Automated Essay Scoring in the Presence of Biased Ratings

Amorim, Evelin; Cançado, Márcia; Veloso, Adriano

doi:10.18653/v1/n18-1021

Cited by 56 publications

(48 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The results prove that BERT expresses strong preferences for male pronouns, raising concerns with using BERT in downstream tasks like resume filtering. Table 5: Percentage of attributes associated more strongly with the male gender 6 Related Work NLP applications ranging from core tasks such as coreference resolution (Rudinger et al, 2018) and language identification (Jurgens et al, 2017), to downstream systems such as automated essay scoring (Amorim et al, 2018), exhibit inherent social biases which are attributed to the datasets used to train the embeddings (Barocas and Selbst, 2016;Zhao et al, 2017;Yao and Huang, 2017).…”

Section: Real World Implicationsmentioning

confidence: 99%

Measuring Bias in Contextualized Word Representations

Kurita¹,

Vyas²,

Pareek³

et al. 2019

Proceedings of the First Workshop on Gender Bias in Natural Language Processing

280

275

View full text Add to dashboard Cite

Contextual word embeddings such as BERT have achieved state of the art performance in numerous NLP tasks. Since they are optimized to capture the statistical properties of training data, they tend to pick up on and amplify social stereotypes present in the data as well. In this study, we (1) propose a template-based method to quantify bias in BERT;(2) show that this method obtains more consistent results in capturing social biases than the traditional cosine based method; and (3) conduct a case study, evaluating gender bias in a downstream task of Gender Pronoun Resolution. Although our case study focuses on gender bias, the proposed technique is generalizable to unveiling other biases, including in multiclass settings, such as racial and religious biases.

show abstract

Section: Real World Implicationsmentioning

confidence: 99%

Measuring Bias in Contextualized Word Representations

Kurita¹,

Vyas²,

Pareek³

et al. 2019

Proceedings of the First Workshop on Gender Bias in Natural Language Processing

280

275

View full text Add to dashboard Cite

show abstract

“…Amorim et al [4] raise awareness about the readers' biases. In order to examine the extent to which these biases affect the ratings and thus affect the algorithms based on these ratings, they investigate a corpus of scored texts which are commented by raters.…”

Section: Related Workmentioning

confidence: 99%

Towards context-aware automated writing evaluation systems

Patout

Cordy

2019

Proceedings of the 1st ACM SIGSOFT International Workshop on Education Through Advanced Software Engineering and Artificial Int

View full text Add to dashboard Cite

Writing is a crucial skill in our society, which is regularly exerted by students across all disciplines. Automated essay scoring and automatic writing evaluation systems can support professors in the evaluation of written texts and, conversely, help students improving their writing. However, most of those systems fail to consider the context of the writing, such as the targeted audience and the genre. In this paper, we depict our vision towards new-generation AES systems that could evaluate written products while considering their specific context. In education, such tools could support students not only in adapting their written product to their particular context, but also in identifying points for improvement and situational settings where their writing is less proficient. CCS CONCEPTS • Social and professional topics → Student assessment; • Computing methodologies → Natural language processing.

show abstract

“…Essa metodologia é usada em alguns trabalhos relacionados [Amorim et al 2018;Sales et al 2019;Moraes et al 2016;Jha et al 2016]. Amorim et al [Amorim et al 2018], por exemplo, utilizam a abordagem de cálculo de subjetividade através de léxicos para avaliar comentários de avaliadores de redações do ENEM (Exame Nacional do Ensino Médio). Para avaliar a subjetividade, a pesquisadora utiliza léxicos de argumentação, sentimento, pressuposição, modalização e valoração.…”

Section: Trabalhos Relacionadosunclassified

“…Em trabalho prévio Sales et al [Sales et al 2019] propuseram o uso de léxicos de subjetividade para medir, por meio de word embeddings, a subjetividade de textos jornalísticos. E stes léxicos foram construídos por [Amorim et al 2018] através da análise manual de expressões que frequentemente aparecem em textos quando o interlocutor aparenta expressar alguma subjetividade. Cada léxico encapsula um aspecto de subjetividade, mais especificamente esses aspectos são:…”

Section: Subjetividadeunclassified

“…Em trabalho prévio, léxicos de subjetividade construídos manualmente por especialistas linguistas [Amorim et al 2018] foram utilizados para medir, por meio de word embeddings, a subjetividade de textos linguísticos [Sales et al 2019]. Nesse artigo, nós estendemos esse trabalho caracterizando a subjetividade de notícias em cinco portais de notícias populares no Brasil: Estadão, Folha de São Paulo, O Antagonista, O Globo e Veja.…”

Section: Introductionunclassified

See 1 more Smart Citation

An Analysis of Subjectivity in Brazilian News

Lima¹,

Melo²,

Marinho³

2019

Anais Do VII Symposium on Knowledge Discovery, Mining and Learning (KDMiLe 2019)

View full text Add to dashboard Cite

With the advent of digital journalism, the democratization of information has become a reality, since news articles are published as soon as the facts occur and are accessible from any device connected to the internet. It is common sense the perception that some newspapers are more biased than others when it comes to the way of exposing the facts. However, automatic ways of measuring such biases is still an open research challenge. Under the premise that journalistic texts must have objective and unbiased language, news with high levels of subjectivity may indicate bias. In this paper, we propose to use subjectivity lexicons to characterize subjectivity in five news portals that are popular in Brazil. To better understand the results found, we performed a correlation analysis between the levels of subjectivity found and readability and news popularity metrics. We believe that the methods we used along with our findings contribute to a better understanding of the linguistic characteristics of the news we consume daily.

show abstract

Automated Essay Scoring in the Presence of Biased Ratings

Cited by 56 publications

References 30 publications

Measuring Bias in Contextualized Word Representations

Measuring Bias in Contextualized Word Representations

Towards context-aware automated writing evaluation systems

An Analysis of Subjectivity in Brazilian News

Contact Info

Product

Resources

About