2020
DOI: 10.1016/j.jksuci.2018.05.010
|View full text |Cite
|
Sign up to set email alerts
|

Feature selection using an improved Chi-square for Arabic text classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
88
0
3

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 179 publications
(127 citation statements)
references
References 14 publications
0
88
0
3
Order By: Relevance
“…In their study, Yelmen et all., established a two-step FS strategy that used IG and genetic search (GS) to obtain the optimum feature subset for sentiment classification [25]. Bahassine et al used an improved FS strategy for Arabic text analysis was and developed an improved version of the CHI filter approach to classify a document of six classes [26].…”
Section: Feature Selection In Text Categorizationmentioning
confidence: 99%
“…In their study, Yelmen et all., established a two-step FS strategy that used IG and genetic search (GS) to obtain the optimum feature subset for sentiment classification [25]. Bahassine et al used an improved FS strategy for Arabic text analysis was and developed an improved version of the CHI filter approach to classify a document of six classes [26].…”
Section: Feature Selection In Text Categorizationmentioning
confidence: 99%
“…p(t) and p(t -) are the probabilities of presence and absence of term t respectively. P(ci|t) and P(ci|t -) are the conditional probabilities of class ci considering presence and absence of t respectively [21], [7], [49], and [4]. IG is used to reduce the entropy caused by partitioning the objects according to an attribute.…”
Section: Performance%mentioning
confidence: 99%
“…There are several algorithms that can be used to classify documents. Examples of such algorithms include; but not limited to; K-nearest neighbor (KNN), support vector machine (SVM), logistic regression (LR), random forest (RF), Naïve Bayes (NB), decision tree (DT), artificial neural network (ANN), and others [2], [4][5][6][7][8][9][10][11][12][13][14][15]. One of the main problems of classifying documents is the huge number of features which are describing a dataset.…”
Section: Introduction and Related Workmentioning
confidence: 99%
“…Bước 2: Lựa chọn đặc trưng: Chúng tôi sử dụng phương pháp Chi Square (CHI) (Bahassine, Madani, Al-Sarem, & Kissi, 2018;Thabtah, 2018) để đánh giá giá độ liên quan của các đặc trưng tới kết quả phân lớp.…”
Section: Xây Dựng Mô Hình Dự đOánunclassified