Hamza Alami scite author profile

AraBERT is an Arabic version of the state-of-the-art Bidirectional Encoder Representations from Transformers (BERT) model. The latter has achieved good performance in a variety of Natural Language Processing (NLP) tasks. In this paper, we propose an effective AraBERT embeddingsbased method for dealing with offensive Arabic language in Twitter. First, we pre-process tweets by handling emojis and including their Arabic meanings. Next, to overcome the pretrain-finetune discrepancy, we substitute each detected emojis by the special token [MASK] into both fine tuning and inference phases. Then, we represent tweets tokens by applying AraBERT model. Finally, we feed the tweet representation into a sigmoid function to decide whether a tweet is offensive or not. The proposed method achieved the best results on OffensEval 2020: Arabic task and reached a macro F1 score equal to 90.17%.

show abstract

Towards a Passages Extraction Method for Arabic Question Answering Systems

Lahbari

Alami

Zidani

et al. 2020

View full text Add to dashboard Cite

Exploring Contextual word representation for Arabic question classification

Alami

En-Nahnahi²,

Ouatik

2020

View full text Add to dashboard Cite

Arabic duplicate questions detection based on contextual representation, class label matching, and structured self attention

Alami

Ouatik

Zidani

et al. 2022

Journal of King Saud University - Computer and Information Scie

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hamza Alami

An arabic question classification method based on new taxonomy and continuous distributed representation of words

LISAC FSDM-USMBA Team at SemEval-2020 Task 12: Overcoming AraBERT’s pretrain-finetune discrepancy for Arabic offensive language identification

Towards a Passages Extraction Method for Arabic Question Answering Systems

Exploring Contextual word representation for Arabic question classification

Arabic duplicate questions detection based on contextual representation, class label matching, and structured self attention

Contact Info

Product

Resources

About