2018 Fourth International Conference on Information Retrieval and Knowledge Management (CAMP) 2018
DOI: 10.1109/infrkm.2018.8464767
|View full text |Cite
|
Sign up to set email alerts
|

Analyzing Malay Stemmer Performance Towards Fuzzy Logic Ranking Function on Malay Text Corpus

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 4 publications
0
4
0
Order By: Relevance
“…For example, term "makanan" will be stemmed to its root word "makan". Dictionary for Malay root word and morphological rules for Malay language are applied in the stemming process [16]. Lastly, the stemmed word are used as keywords to search from the indexed file before the retrieved documents are ranked and displayed to the user [16,26,27].…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, term "makanan" will be stemmed to its root word "makan". Dictionary for Malay root word and morphological rules for Malay language are applied in the stemming process [16]. Lastly, the stemmed word are used as keywords to search from the indexed file before the retrieved documents are ranked and displayed to the user [16,26,27].…”
Section: Methodsmentioning
confidence: 99%
“…Hadith Sahih Al Bukhari Malay from Ar-Rahman Labs features Complete Sahih Bukhari book in Malay but this application only supports keyword search [14]. Mutiara Hadis is the pioneer of the Malay Hadith Information Retrieval [15][16][17][18][19][20]. It is one of the web-based application search engines for hadith translated in Malay language.…”
mentioning
confidence: 99%
“…The aim is to rid the corpus of white space, missing values, duplicate reviews, stop words, non-ASCII characters, and typos that could negatively affect the result of analytics [2]. Stemming [10] reduces words into their base form, where they are considered as one single feature, for example, "walking," "walked" and "walks" are stemmed to "walk." Language detection (LD) in preprocessing helps to reduce the extracted corpus size by filtering out unrelated text based on the language used [11], [12].…”
Section: Introductionmentioning
confidence: 99%
“…Removals [4] typically involve white spaces, missing values, duplicate reviews, stop words, non-ascii characters and typos which could have an adverse influence on the result. Stemming [19] replaces words with their canonical form for example stand in place of standing, stood and stands. Language detection (LD) [6,7,24] is crucial for language-dependent tokenisers and could considerably decrease the size of data extracted.…”
Section: Introductionmentioning
confidence: 99%