2022
DOI: 10.1038/s41598-022-09381-9
|View full text |Cite
|
Sign up to set email alerts
|

Multi-class sentiment analysis of urdu text using multilingual BERT

Abstract: Sentiment analysis (SA) is an important task because of its vital role in analyzing people’s opinions. However, existing research is solely based on the English language with limited work on low-resource languages. This study introduced a new multi-class Urdu dataset based on user reviews for sentiment analysis. This dataset is gathered from various domains such as food and beverages, movies and plays, software and apps, politics, and sports. Our proposed dataset contains 9312 reviews manually annotated by hum… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 53 publications
(44 citation statements)
references
References 52 publications
2
18
0
Order By: Relevance
“…Model A, is a rule based machine learning model for Urdu sentiment analysis using support vector machine, Naïve Bayesian, Adabbost, MLP, LR and RF along with deep learning model using CNN-1D, LSTM, Bi-LSTM, GRU and Bi-GRU techniques [9]. Model B: is the representation of Umair et al, [10], that classically execute different machine learning based classification models such as support vector machine and naïve Bayesian algorithm on Roman Urdu text to test its accuracy.…”
Section: Model A: Khan Et Al 2021 Which Is Abbreviated Asmentioning
confidence: 99%
See 1 more Smart Citation
“…Model A, is a rule based machine learning model for Urdu sentiment analysis using support vector machine, Naïve Bayesian, Adabbost, MLP, LR and RF along with deep learning model using CNN-1D, LSTM, Bi-LSTM, GRU and Bi-GRU techniques [9]. Model B: is the representation of Umair et al, [10], that classically execute different machine learning based classification models such as support vector machine and naïve Bayesian algorithm on Roman Urdu text to test its accuracy.…”
Section: Model A: Khan Et Al 2021 Which Is Abbreviated Asmentioning
confidence: 99%
“…Further, languages such as French, English, Spanish, and other European languages must be addressed in terms of tool accessibility. Despite this, languages like Punjabi, Urdu and Hindi are seen as lacking in [5][6][7][8][9][10].…”
Section: Introductionmentioning
confidence: 99%
“…Khan et al [19] perform SA on Urdu language using a dataset comprising multiple domains including beverages, movies sports, politics, etc. Rule-based, ML, and DL approaches are used for the classification of the text.…”
Section: Deep Learning Approachmentioning
confidence: 99%
“…Users of SN connect, share their thoughts, feelings, and ideas, and participate in discussion groups. Text conversation, or more specifically, emotion classification (EC), is essential to comprehending people's activities since the internet's invisible nature has made it possible for a single user to engage in violent SN speech data [20].…”
Section: Introductionmentioning
confidence: 99%