2014
DOI: 10.1111/ijal.12071
|View full text |Cite
|
Sign up to set email alerts
|

A simple algorithm for the problem of suffix stripping

Abstract: Suffix stripping is a problem of removing morphological suffixes from a word to get the stem. We present suffix stripping as an unconstrained optimization problem. Free from linguistic or morphological knowledge, a simple algorithm is being developed. Superiority of the algorithm over an established technique for English language is being demonstrated. Suffix stripping ist der Prozess des systematischen Entfernens von Suffixen um zum Stamm zu gelangen. Wir präsentieren Suffix Stripping als ein Optimierungs pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 22 publications
0
4
0
Order By: Relevance
“…To compare our approach with some standard algorithms from the literature, we also applied the following three stemming treatments over the sample being studied: Porter's algorithm, 7 Lancaster algorithm 8 and Approx-stem. 26 Columns 6, 8 and 10 of Table 3 present the outcomes of the above algorithms and columns 7, 9 and 11 record the lengths of corresponding stems, respectively. To implement Lancaster algorithm, 8 we employed a standard package: NLTK 3.4.5 in Python.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…To compare our approach with some standard algorithms from the literature, we also applied the following three stemming treatments over the sample being studied: Porter's algorithm, 7 Lancaster algorithm 8 and Approx-stem. 26 Columns 6, 8 and 10 of Table 3 present the outcomes of the above algorithms and columns 7, 9 and 11 record the lengths of corresponding stems, respectively. To implement Lancaster algorithm, 8 we employed a standard package: NLTK 3.4.5 in Python.…”
Section: Resultsmentioning
confidence: 99%
“…33 Nonparametric tests like Wilcoxon test have been used by various authors in the literature to evaluate the performance of their algorithms. [19][20][21]26 We choose to apply Kruskal-Wallis test for multiple samples. This test is the nonparametric alternative to the one-way ANOVA.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…This process of breaking user supplied answers into distinct word and comparing the standard keyword to user supplied answer is the most crucial part of the subjective examination process. Along with the standard keywords, the synonyms of the keywords submitted by the user will also be extracted and tokenized using the Porters Algorithm [11]. The tokenized keywords are the words that provide the key concept which are of significance in a document.…”
Section: Answer Processingmentioning
confidence: 99%