2018 6th International Conference on Cyber and IT Service Management (CITSM) 2018
DOI: 10.1109/citsm.2018.8674054
|View full text |Cite
|
Sign up to set email alerts
|

An Improved of Stemming Algorithm for Mining Indonesian Text with Slang on Social Media

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0
3

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
2
1

Relationship

1
9

Authors

Journals

citations
Cited by 27 publications
(14 citation statements)
references
References 13 publications
0
11
0
3
Order By: Relevance
“…The pre-processing text data begins with preparing data collection from the real data into clean data text after data cleaning, stopwords removal, and stemming process. For stopwords, it used the modified collection of Indonesian stopwords and Porter Stemming so that the structure and grammar are suitable with the Indonesian language [26,43]. Table 1 shows the example of the original Indonesian News data that becomes the data after text pre-processing.…”
Section: Results Of Text Pre-processingmentioning
confidence: 99%
“…The pre-processing text data begins with preparing data collection from the real data into clean data text after data cleaning, stopwords removal, and stemming process. For stopwords, it used the modified collection of Indonesian stopwords and Porter Stemming so that the structure and grammar are suitable with the Indonesian language [26,43]. Table 1 shows the example of the original Indonesian News data that becomes the data after text pre-processing.…”
Section: Results Of Text Pre-processingmentioning
confidence: 99%
“…Even tough, in several research in text mining, the stemming process does not give a big effect in accuracy [45]. Stemming process is depend on the language, from many Indonesian stemming algorithm [46][47][48][49], this research use an improved Porter algorthm that modified based on Indonesian language [50]. The example result of text pre-processing is available in Table 2 which is pre-processing result from the text hadits example from Table 1 that provided in Indonesian Language.…”
Section: Text Pre-processingmentioning
confidence: 99%
“…Oleh karena itu, penelitian ini bertujuan untuk menangani overstemming dengan melakukan perubahan (modifikasi) pada proses stemming. Proses perubahan (modifikasi) stemming dilakukan dengan menggabungkan algoritma look-up table yang berisi tabel aturan kata dan algoritma stemming Porter yang sudah dikenal [8], [9]. Tabel aturan kata tersebut dimasukkan ke dalam database sehingga dapat memudahkan proses update jenis kata yang mengalami overstemming.…”
Section: P-issn: 2621-8070 E-issn:2686-3219unclassified