An Empirical Study on the Feature’s Type Effect on the Automatic Classification of Arabic Documents

Raheel, Saeed; Dichy, Joseph

doi:10.1007/978-3-642-12116-6_57

Cited by 9 publications

(8 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…TC algorithms require that text features are formatted before they can be interpreted by the specified classifier, this process is also referred to as term weighting because each term is entered together with a weight value. Included papers show the most used technique is the Term Frequency-Inverse Document Frequency (TF-IDF) as in [27,32,37,40,43,45,48,51,53,55,57,58,[60][61][62]67]. It is a statistical method to indicate the significance of a word within a given corpus.…”

Section: E Feature Reresentation (Term Weighting)mentioning

confidence: 99%

Arabic text classification methods: Systematic literature review of primary studies

Alabbas

Al-Khateeb

Mansour

2016

2016 4th IEEE International Colloquium on Information Science and Technology (CiSt)

View full text Add to dashboard Cite

Section: E Feature Reresentation (Term Weighting)mentioning

confidence: 99%

Arabic text classification methods: Systematic literature review of primary studies

Alabbas

Al-Khateeb

Mansour

2016

2016 4th IEEE International Colloquium on Information Science and Technology (CiSt)

View full text Add to dashboard Cite

“…Raheel et al [6] combined the Boosting method and the decision tree as a hybrid classifier. They used lemmatisation as a method of extracting the characteristics, and the TFIDF for the weighting.…”

Section: Related Workmentioning

confidence: 99%

Toward a Complex System for Context Discovery to Index Arabic Documents

Bazzi¹

2018

JCP

View full text Add to dashboard Cite

Text indexing aims to take the full advantage of textual data to help intelligent programs to make relevant decisions. In order to explore a large amount of textual documents, and to disclose semantic information hidden in unstructured documents, like texts, an effective indexation system is required. In this paper, we propose a new approach for indexing Arabic texts. Based on the semantic proximity and taking into account the contexts contained in each document, our method is denoted contextual indexing. Several algorithms are used for keywords extraction, each of them emphasizes some criterion. However, we target the most descriptive keywords for each document. We also propose a new approach for document modeling. We compared the results obtained using our method with those obtained by an indexation system based on a standard statistical method. The experimental results demonstrate the performance of our approach.

show abstract

“…Raheel et al [3] have shown in a comparative study the influence of the choice of entities representing a document, on manipulating the performance of classifiers. They selected as descriptors, words in their original form, lemmas, roots, and the ngrams.…”

Section: Related Workmentioning

confidence: 99%

“…Document indexing involves extracting keywords that best represent a document. In spite of the essential role of this phase in the next step of the natural language processing process, few are the works identified at this level [1][2] [3].…”

Section: Introductionmentioning

confidence: 99%

Features based approach for indexation and representation of unstructured Arabic documents

Bazzi¹,

Mammass²,

Ennaji³

et al. 2017

Adv. sci. technol. eng. syst. j.

View full text Add to dashboard Cite

show abstract

An Empirical Study on the Feature’s Type Effect on the Automatic Classification of Arabic Documents

Cited by 9 publications

References 10 publications

Arabic text classification methods: Systematic literature review of primary studies

Arabic text classification methods: Systematic literature review of primary studies

Toward a Complex System for Context Discovery to Index Arabic Documents

Features based approach for indexation and representation of unstructured Arabic documents

Contact Info

Product

Resources

About