2010
DOI: 10.1007/978-3-642-12116-6_57
|View full text |Cite
|
Sign up to set email alerts
|

An Empirical Study on the Feature’s Type Effect on the Automatic Classification of Arabic Documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 10 publications
0
8
0
Order By: Relevance
“…TC algorithms require that text features are formatted before they can be interpreted by the specified classifier, this process is also referred to as term weighting because each term is entered together with a weight value. Included papers show the most used technique is the Term Frequency-Inverse Document Frequency (TF-IDF) as in [27,32,37,40,43,45,48,51,53,55,57,58,[60][61][62]67]. It is a statistical method to indicate the significance of a word within a given corpus.…”
Section: E Feature Reresentation (Term Weighting)mentioning
confidence: 99%
“…TC algorithms require that text features are formatted before they can be interpreted by the specified classifier, this process is also referred to as term weighting because each term is entered together with a weight value. Included papers show the most used technique is the Term Frequency-Inverse Document Frequency (TF-IDF) as in [27,32,37,40,43,45,48,51,53,55,57,58,[60][61][62]67]. It is a statistical method to indicate the significance of a word within a given corpus.…”
Section: E Feature Reresentation (Term Weighting)mentioning
confidence: 99%
“…Raheel et al [6] combined the Boosting method and the decision tree as a hybrid classifier. They used lemmatisation as a method of extracting the characteristics, and the TFIDF for the weighting.…”
Section: Related Workmentioning
confidence: 99%
“…Raheel et al [3] have shown in a comparative study the influence of the choice of entities representing a document, on manipulating the performance of classifiers. They selected as descriptors, words in their original form, lemmas, roots, and the ngrams.…”
Section: Related Workmentioning
confidence: 99%
“…Document indexing involves extracting keywords that best represent a document. In spite of the essential role of this phase in the next step of the natural language processing process, few are the works identified at this level [1][2] [3].…”
Section: Introductionmentioning
confidence: 99%