2014
DOI: 10.3844/jcssp.2014.2232.2239
|View full text |Cite
|
Sign up to set email alerts
|

A Comparative Study of Combined Feature Selection Methods for Arabic Text Classification

Abstract: Text classification is a very important task due to the huge amount of electronic documents. One of the problems of text classification is the high dimensionality of feature space. Researchers proposed many algorithms to select related features from text. These algorithms have been studied extensively for English text, while studies for Arabic are still limited. This study introduces an investigation on the performance of five widely used feature selection methods namely Chi-square, Correlation, GSS Coefficien… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(13 citation statements)
references
References 21 publications
(21 reference statements)
0
13
0
Order By: Relevance
“…The S features describing the dataset can be considered as S vectors. The similarity measure Sim(fi,fj) between any two features fi and fj can be computed as shown in equation (16). (16) As a result, the matrix in Figure represents the similarity values among the individual features.…”
Section: Proposal Of a Semantic Fusion Methods (Sf-mw) For Featuresmentioning
confidence: 99%
See 1 more Smart Citation
“…The S features describing the dataset can be considered as S vectors. The similarity measure Sim(fi,fj) between any two features fi and fj can be computed as shown in equation (16). (16) As a result, the matrix in Figure represents the similarity values among the individual features.…”
Section: Proposal Of a Semantic Fusion Methods (Sf-mw) For Featuresmentioning
confidence: 99%
“…A huge number of features; in most cases; may reduce the efficiency of the adopted classifiers and also consume more time. So, the feature selection process is very important to choose a subset of high significant features and eliminate the non-significant ones [16]. Moreover, several research works were presented regarding text classification, machine learning algorithms, and feature selection methods.…”
Section: Introduction and Related Workmentioning
confidence: 99%
“…In [20], the authors studied the effect of five well-known feature selection methods: Correlation, X 2 , Information Gain [15], GSS Coefficient [21], and Relief-F [22]. Moreover, this paper proposed an approach of combination of feature selection methods based on the average weight of the features.…”
Section: Related Workmentioning
confidence: 99%
“…There are several techniques of machine learning in sentiment analysis they are lexicon machine learning and rule-based method. [12] Ml methods use different machine learning algorithms and labeled data to train a classifier to find sentiment. In [13] this paper deep convolution with text normalization and character level embedding, for both unstructured and structured data is conducted to perform sentiment analysis to handle the less memory space.…”
Section: II Literature Reviewmentioning
confidence: 99%