“…69 Examples of setups and applications are (but not limited to) social media, 70 healthcare, [71][72][73] information retrieval, 74 sentiment analysis, [75][76][77][78][79] content-based recommender systems, 80 document summarization, 81,82 various business and marketing applications, [83][84][85] and legal document categorization. 86 A variety of languages were targeted over time for the popular text classification task, including well-studied languages, such as Arabic, 87,88 Turkish, 83,[89][90][91] French, 71,92 Spanish, 72 and Indian, 93 as well as underresourced languages, such as Romanian. 94 The applied classification techniques range from shallow methods, such as Logistic Regression, 95 SVM, 96 and Naïve Bayes, 97 to more complex and resource-hungry deep neural networks, such as CNNs, 62,98 Hierarchical Attention Networks (HANs), 99 and the powerful transformer-based methods that started to dominate the landscape in recent years.…”