Exploring Deep Multimodal Fusion of Text and Photo for Hate Speech Classification

Yang, Fan; Xu, Peng; Ghosh, Gargi; Shilon, Reshef; Ma, Hao; Moore, Eider B; Predovic, Goran

doi:10.18653/v1/w19-3502

Cited by 70 publications

(38 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Indeed the community is aware of this form of abuse and there have been several attempts for multimodal analysis (Singh et al, 2017;Yang et al, 2019;Gomez et al, 2020). In our work, however, we do not address the aspect of multimodal abuse simply because many datasets only include the textual component of a micropost and the reconstruction of non-textual components of posts can only be reconstructed with greater effort or even not be obtained at all.…”

Section: Multimodal Abusementioning

confidence: 99%

Implicitly Abusive Language – What does it actually look like and why are we not getting there?

Wiegand¹,

Ruppenhofer²,

Eder³

2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

Abusive language detection is an emerging field in natural language processing which has received a large amount of attention recently. Still the success of automatic detection is limited. Particularly, the detection of implicitly abusive language, i.e. abusive language that is not conveyed by abusive words (e.g. dumbass or scum), is not working well. In this position paper, we explain why existing datasets make learning implicit abuse difficult and what needs to be changed in the design of such datasets. Arguing for a divide-and-conquer strategy, we present a list of subtypes of implicitly abusive language and formulate research tasks and questions for future research.

show abstract

Section: Multimodal Abusementioning

confidence: 99%

Implicitly Abusive Language – What does it actually look like and why are we not getting there?

Wiegand¹,

Ruppenhofer²,

Eder³

2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

show abstract

“…A large portion of the hateful content that is shared on social media is in the form of memes, which feature multiple modalities like audio, text, images and videos in some cases as well. [54] present different fusion approaches to tackle multi-modal information for hate speech detection. [11] explore multi-modal hate speech consisting of text and image modalities.…”

Section: Related Workmentioning

confidence: 99%

Exploring Multi-Task Multi-Lingual Learning of Transformer Models for Hate Speech and Offensive Speech Identification in Social Media

2021

View full text Add to dashboard Cite

Hate Speech has become a major content moderation issue for online social media platforms. Given the volume and velocity of online content production, it is impossible to manually moderate hate speech related content on any platform. In this paper we utilize a multi-task and multi-lingual approach based on recently proposed Transformer Neural Networks to solve three sub-tasks for hate speech. These sub-tasks were part of the 2019 shared task on hate speech and offensive content (HASOC) identification in Indo-European languages. We expand on our submission to that competition by utilizing multi-task models which are trained using three approaches, (a) multi-task learning with separate task heads, (b) back-translation, and (c) multilingual training. Finally, we investigate the performance of various models and identify instances where the Transformer based models perform differently and better. We show that it is possible to to utilize different combined approaches to obtain models that can generalize easily on different languages and tasks, while trading off slight accuracy (in some cases) for a much reduced inference time compute cost. We open source an updated version of our HASOC 2019 code with the new improvements at https ://githu b.com/socia lmedi aie/MTML_HateS peech .

show abstract

“…Most recent works have focused on leveraging neural networks in this task (Chen et al, 2015;Nguyen and Grishman, 2015;Nguyen et al, 2016;Ghaeini et al, 2016;Feng et al, 2016). The existing approaches can be categorized into two classes: The first class is to improve ED through special learning techniques including adversarial training (Hong et al, 2018), knowledge distillation (Liu et al, 2019; and model pretraining (Yang et al, 2019). The second class is to improve ED by introducing extra resource, such as argument information , document information (Duan et al, 2017;Chen et al, 2018), multi-lingual information (Liu et al, 2018a(Liu et al, , 2019, knowledge base and syntactic information (Sha et al, 2018).…”

Section: Related Workmentioning

confidence: 99%

Edge-Enhanced Graph Convolution Networks for Event Detection with Syntactic Relation

Cui¹,

Yu²,

Liu³

et al. 2020

Findings of the Association for Computational Linguistics: EMNLP 2020

View full text Add to dashboard Cite

Event detection (ED), a key subtask of information extraction, aims to recognize instances of specific event types in text. Previous studies on the task have verified the effectiveness of integrating syntactic dependency into graph convolutional networks. However, these methods usually ignore dependency label information, which conveys rich and useful linguistic knowledge for ED. In this paper, we propose a novel architecture named Edge-Enhanced Graph Convolution Networks (EE-GCN), which simultaneously exploits syntactic structure and typed dependency label information to perform ED. Specifically, an edge-aware node update module is designed to generate expressive word representations by aggregating syntactically-connected words through specific dependency types. Furthermore, to fully explore clues hidden in dependency edges, a node-aware edge update module is introduced, which refines the relation representations with contextual information. These two modules are complementary to each other and work in a mutual promotion way. We conduct experiments on the widely used ACE2005 dataset and the results show significant improvement over competitive baseline methods 1 .

show abstract

Exploring Deep Multimodal Fusion of Text and Photo for Hate Speech Classification

Cited by 70 publications

References 25 publications

Implicitly Abusive Language – What does it actually look like and why are we not getting there?

Implicitly Abusive Language – What does it actually look like and why are we not getting there?

Exploring Multi-Task Multi-Lingual Learning of Transformer Models for Hate Speech and Offensive Speech Identification in Social Media

Edge-Enhanced Graph Convolution Networks for Event Detection with Syntactic Relation

Contact Info

Product

Resources

About