2012 Colloquium in Information Science and Technology 2012
DOI: 10.1109/cist.2012.6388065
|View full text |Cite
|
Sign up to set email alerts
|

Stemming versus Light Stemming for measuring the simitilarity between Arabic Words with Latent Semantic Analysis model

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 5 publications
0
8
0
Order By: Relevance
“…So far, the research studies presented by Froud et al (2010) and Froud, Lachkar & Ouatik (2012b) are the only works that have investigated the effect of using stemming on semantic similarity of Arabic text. Froud et al (2010) investigated diverse similarity measures with document clustering and they applied stemming to words which have reduced documents representation and provided fast clustering.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…So far, the research studies presented by Froud et al (2010) and Froud, Lachkar & Ouatik (2012b) are the only works that have investigated the effect of using stemming on semantic similarity of Arabic text. Froud et al (2010) investigated diverse similarity measures with document clustering and they applied stemming to words which have reduced documents representation and provided fast clustering.…”
Section: Related Workmentioning
confidence: 99%
“… Froud et al (2010) investigated diverse similarity measures with document clustering and they applied stemming to words which have reduced documents representation and provided fast clustering. Froud, Lachkar & Ouatik (2012b) tested the effect of using stemming and light stemming on the semantic similarity between Arabic words. The similarity is measured by Latent Semantic Analysis (LSA) and computed by using different measures.…”
Section: Related Workmentioning
confidence: 99%
“…We choose to discuss [3] algorithm since it's one of the most successful approaches to Arabic Stemming [6], the Light10 algorithm which is the modification version of Light8 and developed by [3], has outperformed most of the morphological analyzers and tries to improve the information retrieval performance [13,10,12,17]. [3] constructed the stemmers "suffix 1 and prefix 2 " based on heuristics, light10 Stemming Algorithm Steps are:  Remove ‫"و"‬ for V1, light 2, light 3, light 8, and light10 if the remainder of the word is 3 or more characters long.…”
Section: Stemmer Based On Suffix and Prefixmentioning
confidence: 99%
“…Classification [21], [22], [23], [24], [25], [26], [27], [28], [29], [30], [31], [32], [33] , [34], [35], [36], [37], [38], [39], [40], [41], [42], [43], [44], [45], [46], [47], [18], [48], [49], [50] 31(29%) Stemming and Lemmatization [51], [52], [53], [54], [55], [4], [56], [57], [58], [59], [60], [61] 12(11%) Information Retrieval [9], [62], [63], [64], [65], [66], [67], …”
Section: Techniquementioning
confidence: 99%