2019
DOI: 10.11591/ijeecs.v13.i1.pp420-426
|View full text |Cite
|
Sign up to set email alerts
|

Flexibility of Indonesian text pre-processing library

Abstract: <span lang="EN-US">This study aimed to achieve and measure flexibility as a software quality factor of text pre-processing libraries with Indonesian text from social media. Library was built based on a review of some text mining applications that did not yet have a special pre-process for Indonesian text. Text pre-processing libraries were designed and built using an object-oriented approach that was modular to achieve flexibility. Flexibility was measured by the Mc Call Cyclomatic Complexity (CC) metric… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 14 publications
(4 citation statements)
references
References 22 publications
0
4
0
Order By: Relevance
“…Test pre-processing is an important phase in Text Mining to prepare text data well befor conducting the mining process [41,42], among others tokenizing, casefolding, cleaning text data, stopwords removal and stemming. Tokenizing and casefolding prepare text data to be easy to change into structured representation with specific and uniform term.…”
Section: Text Pre-processingmentioning
confidence: 99%
“…Test pre-processing is an important phase in Text Mining to prepare text data well befor conducting the mining process [41,42], among others tokenizing, casefolding, cleaning text data, stopwords removal and stemming. Tokenizing and casefolding prepare text data to be easy to change into structured representation with specific and uniform term.…”
Section: Text Pre-processingmentioning
confidence: 99%
“…In the Implementation phase, to build the chatbot there are several text pre-processing that conducted before running MNB and RAKE algorithm. Text data is unstructured data that must be transformed into structured data, so that the text pre-processing is an importan phase to get good result [37], [38]. Text pre-processing means to prepare initial text data that is still various to be used as regular data that can be subjected to or applied by several existing text mining methods [39].…”
Section: Fig 2 Work Flow Of Chatbot In Quetion Answering Systemmentioning
confidence: 99%
“…Text Pre-processing is an important phase in natural language processing [85], [86]. Generally, text pre-processing prepares Indonesian document corpus with several process, among others tokenizing each paragraph and each sentence, lowering case, removing character non-letter and regular expression, removing Indonesian stopwords (remove unuseful words), and stemming using Porter algorithm for Indonesian language (where stemming process is the change of words with affixes into its basic words).…”
Section: Structured Representation For Textmentioning
confidence: 99%