Paolo Rosso scite author profile

The paper describes the organization of the SemEval 2019 Task 5 about the detection of hate speech against immigrants and women in Spanish and English messages extracted from Twitter. The task is organized in two related classification subtasks: a main binary subtask for detecting the presence of hate speech, and a finer-grained one devoted to identifying further features in hateful contents such as the aggressive attitude and the target harassed, to distinguish if the incitement is against an individual rather than a group. HatEval has been one of the most popular tasks in SemEval-2019 with a total of 108 submitted runs for Subtask A and 70 runs for Subtask B, from a total of 74 different teams. Data provided for the task are described by showing how they have been collected and annotated. Moreover, the paper provides an analysis and discussion about the participant systems and the results they achieved in both subtasks.

show abstract

A multidimensional approach for detecting irony in Twitter

Reyes

Rosso

Veale

2012

Lang Resources & Evaluation

341

249

View full text Add to dashboard Cite

Overview of the Evalita 2018 Task on Automatic Misogyny Identification (AMI)

Fersini¹,

Nozza²,

Rosso³

2018

159

192

View full text Add to dashboard Cite

From humor recognition to irony detection: The figurative language of social media

Reyes

Rosso

Buscaldi

2012

Data & Knowledge Engineering

271

164

View full text Add to dashboard Cite

ElsevierReyes Pérez, A.; Rosso, P.; Buscaldi, D. (2012) AbstractThe research described in this paper focuses on analyzing two playful domains of language: humor and irony, in order to identify key values components for their automatic processing. In particular, we focus on describing a model for recognizing these phenomena in social media, such as "tweets". Our experiments are centered on five data sets retrieved from Twitter taking advantage of usergenerated tags, such as "#humor" and "#irony". The model, which is based on textual features, is assessed on two dimensions: representativeness and relevance. The results, apart from providing some valuable insights into the creative and figurative usages of language, are positive regarding humor, and encouraging regarding irony.

show abstract

SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter

Ghosh¹,

Li²,

Veale³

et al. 2015

151

127

View full text Add to dashboard Cite

This report summarizes the objectives and evaluation of the SemEval 2015 task on the sentiment analysis of figurative language on Twitter (Task 11). This is the first sentiment analysis task wholly dedicated to analyzing figurative language on Twitter. Specifically, three broad classes of figurative language are considered: irony, sarcasm and metaphor. Gold standard sets of 8000 training tweets and 4000 test tweets were annotated using workers on the crowdsourcing platform CrowdFlower. Participating systems were required to provide a fine-grained sentiment score on an 11-point scale (-5 to +5, including 0 for neutral intent) for each tweet, and systems were evaluated against the gold standard using both a Cosinesimilarity and a Mean-Squared-Error measure.

show abstract

Convolutional Neural Networks for Authorship Attribution of Short Texts

et al. 2017

View full text Add to dashboard Cite

show abstract

Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features

et al. 2011

View full text Add to dashboard Cite

Wikipedia is an online encyclopedia which anyone can edit. While most edits are constructive, about 7% are acts of vandalism. Such behavior is characterized by modifications made in bad faith; introducing spam and other inappropriate content. In this work, we present the results of an effort to integrate three of the leading approaches to Wikipedia vandalism detection: a spatiotemporal analysis of metadata (STiki), a reputation-based system (Wiki-Trust), and natural language processing features. The performance of the resulting joint system improves the state-of-the-art from all previous methods and establishes a new baseline for Wikipedia vandalism detection. We examine in detail the contribution of the three approaches, both for the task of discovering fresh vandalism, and for the task of locating vandalism in the complete set of Wikipedia revisions. Authors appear alphabetically. Order does not reflect contribution magnitude.

show abstract

Automatic Identification and Classification of Misogynistic Language on Twitter

Anzovino

Fersini

Rosso

2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Paolo Rosso

SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter

A multidimensional approach for detecting irony in Twitter

Overview of the Evalita 2018 Task on Automatic Misogyny Identification (AMI)

From humor recognition to irony detection: The figurative language of social media

SemEval-2015 Task 11: Sentiment Analysis of Figurative Language in Twitter

Convolutional Neural Networks for Authorship Attribution of Short Texts

Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features

Automatic Identification and Classification of Misogynistic Language on Twitter

Contact Info

Product

Resources

About