2013 UKSim 15th International Conference on Computer Modelling and Simulation 2013
DOI: 10.1109/uksim.2013.135
|View full text |Cite
|
Sign up to set email alerts
|

Arabic Text Classification Based on Features Reduction Using Artificial Neural Networks

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 22 publications
(10 citation statements)
references
References 9 publications
0
10
0
Order By: Relevance
“…TC algorithms require that text features are formatted before they can be interpreted by the specified classifier, this process is also referred to as term weighting because each term is entered together with a weight value. Included papers show the most used technique is the Term Frequency-Inverse Document Frequency (TF-IDF) as in [27,32,37,40,43,45,48,51,53,55,57,58,[60][61][62]67]. It is a statistical method to indicate the significance of a word within a given corpus.…”
Section: E Feature Reresentation (Term Weighting)mentioning
confidence: 99%
“…TC algorithms require that text features are formatted before they can be interpreted by the specified classifier, this process is also referred to as term weighting because each term is entered together with a weight value. Included papers show the most used technique is the Term Frequency-Inverse Document Frequency (TF-IDF) as in [27,32,37,40,43,45,48,51,53,55,57,58,[60][61][62]67]. It is a statistical method to indicate the significance of a word within a given corpus.…”
Section: E Feature Reresentation (Term Weighting)mentioning
confidence: 99%
“…-identifying the term significance is very useful in TC system, that can be done by weighting term which can be calculated by multiplying TF by the IDF [2] Which can be calculated as in Equation 1.…”
Section: -Tf-idfmentioning
confidence: 99%
“…It has been applied effectively on various occasions and is incorporated in our regular day to day existences. For example, daily paper articles and scholastic papers are frequently composed by subject or field [2]. TC gives a solution in consequently arranging innumerable articles and papers.…”
Section: Introductionmentioning
confidence: 99%
“…Third, dimension reduction, in this step, the most useful and valuable features for classification are selected. Most commonly selection methods are Chi square and document frequency, information gain and mutual information [14,15,16,17] Usually text documents are represented as a vector of term weights (word features). One major problem in classification for Arabic and English language is the high dimensionality of text documents.…”
Section: Arabic Languagementioning
confidence: 99%