π-Excess σ2P ligands: synthesis of biaryl-type 1,3-benzazaphosphole hybrid ligands and formation of P^P′–M(CO)4 chelate complexes

Current research on hate speech analysis is typically oriented towards monolingual and single classification tasks. In this paper, we present a new multilingual multi-aspect hate speech analysis dataset and use it to test the current state-of-the-art multilingual multitask learning approaches. We evaluate our dataset in various classification settings, then we discuss how to leverage our annotations in order to improve hate speech detection and classification in general. 4 https://competitions.codalab.org/competitions/19935

show abstract

Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech Datasets

Ousidhoum¹,

Song²,

Yeung³

2020

View full text Add to dashboard Cite

Work on bias in hate speech typically aims to improve classification performance while relatively overlooking the quality of the data. We examine selection bias in hate speech in a language and label independent fashion. We first use topic models to discover latent semantics in eleven hate speech corpora, then, we present two bias evaluation metrics based on the semantic similarity between topics and search words frequently used to build corpora. We discuss the possibility of revising the data collection process by comparing datasets and analyzing contrastive case studies.

show abstract

Probing Toxic Content in Large Pre-Trained Language Models

Ousidhoum¹,

Zhao²,

Fang³

et al. 2021

View full text Add to dashboard Cite

Large pre-trained language models (PTLMs) have been shown to carry biases towards different social groups which leads to the reproduction of stereotypical and toxic content by major NLP systems. We propose a method based on logistic regression classifiers to probe English, French, and Arabic PTLMs and quantify the potentially harmful content that they convey with respect to a set of templates. The templates are prompted by a name of a social group followed by a cause-effect relation. We use PTLMs to predict masked tokens at the end of a sentence in order to examine how likely they enable toxicity towards specific communities. We shed the light on how such negative content can be triggered within unrelated and benign contexts based on evidence from a large-scale study, then we explain how to take advantage of our methodology to assess and mitigate the toxicity transmitted by PTLMs.

show abstract

Multilingual and Multi-Aspect Hate Speech Analysis

Ousidhoum¹,

Lin²,

Zhang³

et al. 2019

Preprint

View full text Add to dashboard Cite

On the importance and challenges of the experimental design of multilingual toxic content detection

Ousidhoum¹

View full text Add to dashboard Cite

Towards the Refinement of the Arabic Soundex

Ousidhoum

Bensaou

2013

View full text Add to dashboard Cite

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)

Muhammad¹,

Abdulmumin²,

Yimam³

et al. 2023

View full text Add to dashboard Cite

Varifocal Question Generation for Fact-checking

Ousidhoum¹,

Yuan²,

Vlachos³

2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Nedjma Ousidhoum

Multilingual and Multi-Aspect Hate Speech Analysis

Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech Datasets

Probing Toxic Content in Large Pre-Trained Language Models

Multilingual and Multi-Aspect Hate Speech Analysis

On the importance and challenges of the experimental design of multilingual toxic content detection

Towards the Refinement of the Arabic Soundex

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)

Varifocal Question Generation for Fact-checking

Contact Info

Product

Resources

About