“…Current dominant IE systems consider mention-level scoring of NER as well as RE components when reporting on datasets such as CoNLL-2003(Akbik et al, 2019Baevski et al, 2019;Chiu & Nichols, 2016;Lample et al, 2016), OntoNotes (Chiu & Nichols, 2016;Clark et al, 2018;Strubell et al, 2017), ACE 2004 (Bekoulis et al, 2018a;Li & Ji, 2014;Zhang et al, 2017a), ACE 2005(Fei et al, 2020Luan et al, 2019;Zhang et al, 2017a), TACRED (Soares et al, 2019;Zhang et al, 2018Zhang et al, , 2017b, and SelEval 2010-Task 8 (Guo et al, 2019;Hu et al, 2020;Peters et al, 2019) among others. In contrast, the DWIE dataset is entity-centric where all the annotations are done on the entity cluster level.…”