2022
DOI: 10.11591/ijai.v11.i3.pp1041-1048
|View full text |Cite
|
Sign up to set email alerts
|

The feature extraction for classifying words on social media with the Naïve Bayes algorithm

Abstract: To classify Naïve Bayes classification (NBC), however, it is necessary to have a previous pre-processing and feature extraction. Generally, pre-processing eliminates unnecessary words while feature extraction processes these words. This paper focuses on feature extraction in which calculations and searches are used by applying word2vec while in frequency using term frequency-Inverse document frequency (TF-IDF). The process of classifying words on Twitter with 1734 tweets which are defined as a document to weig… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 23 publications
0
1
0
1
Order By: Relevance
“…Under the condition that the position  and the space  are certain, the gray value can be simplified to ( , ) P i j and the parameter ( , ) ij takes a range of values related to the number of gray levels L , which is a natural number less than or equal to 1 L  . The grayscale co-occurrence matrix uses the grayscale correlation of each pixel in the space to predict the probability of a certain grayscale value, and ultimately to achieve the description of texture features [19]. However, the computational pressure of the model would be too high if the probability of a gray value is calculated for all locations in the space.…”
Section: Extraction Of Texture Features In Video Imagesmentioning
confidence: 99%
“…Under the condition that the position  and the space  are certain, the gray value can be simplified to ( , ) P i j and the parameter ( , ) ij takes a range of values related to the number of gray levels L , which is a natural number less than or equal to 1 L  . The grayscale co-occurrence matrix uses the grayscale correlation of each pixel in the space to predict the probability of a certain grayscale value, and ultimately to achieve the description of texture features [19]. However, the computational pressure of the model would be too high if the probability of a gray value is calculated for all locations in the space.…”
Section: Extraction Of Texture Features In Video Imagesmentioning
confidence: 99%
“…Bayesian optimization is an approach technique for searching for the optimum value of a function by using the probabilistic of the overall search and evaluating the function [24,25], Bayesian will use the theory of Bayesian probability for an iterative model so that it can have the advantage of updating initial knowledge [26,27]. This research can help in improving the model that ignores text or information that has important value in producing a summary.…”
Section: Introductionmentioning
confidence: 99%
“…[8][9]. Text Clustering adalah salah satu metode yang bertujuan untuk meng-Volume 4, Nomor 1, Desember 2022, Page 40-47 ISSN ISSN 2808-005X Available Online at http://ejournal.sisfokomtek.org/index.php/jumin Rizky Dea Mustika, Copyright © 2022, JUMIN, Page 41 Submitted: 15/11/2022; Accepted: 25/11/2022; Published: 15/12/2022 cluster data yang berupa dokumen text mejadi lebih terstuktur.…”
unclassified