“…Several hate speech datasets are publicly available, e.g., for English (Waseem and Hovy, 2016;Davidson et al, 2017;Nobata et al, 2016;Jigsaw, 2018), Spanish (Fersini et al, 2018), Italian (Poletto et al, 2017;Sanguinetti et al, 2018), German (Ross et al, 2016), Hindi (Kumar et al, 2018), and Portuguese (de Pelle and Moreira, 2017). In this section, we analyze the data collection strategy, the annotation method and the dataset properties of three representative hate speech datasets: the Hate speech, Racism and Sexism dataset by Waseem and Hovy (2016), the Offensive Language Dataset by Davidson et al (2017), and the Portuguese News Comments dataset by de Pelle and Moreira (2017).…”