“…Corpus and computational linguists across the world have been working in the past three decades on building Arabic corpora (Abbas & Smaili, 2005;Al-Sulaiti & Atwell, 2006;Brierley & El-Farahaty, 2019;El-Farahaty & Elewa, 2020;El-Haj & Koulali, 2013;Goweder & De Roeck, 2001); databases (Boudelaa & Marslen-Wilson, 2010;Khwaileh et al, 2018); online interfaces (Dukes & Atwell, 2012;Sharoff, 2006) and developing NLP tools (Habash, 2010;Al-Jawfi, 2009, among others). Although they are constantly increasing (Alfaifi & Atwell, 2016), Arabic is understudied by corpus-based methodologies compared to its demographic and societal relevance (McEnery et al, 2019:1).…”