2021
DOI: 10.33395/sinkron.v6i1.11179
|View full text |Cite
|
Sign up to set email alerts
|

TF-IDF Method and Vector Space Model Regarding the Covid-19 Vaccine on Online News

Abstract: Advances in information and technology have caused the use of the internet to be a concern of the general public. Online news sites are one of the technologies that have developed as a means of disseminating the latest information in the world. When viewed in terms of numbers, newsreaders are very sufficient to get the desired information. However, with this, the amount of information collected will result in an explosion of information and the possibility of information redundancy. The search system is one of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
1
0
2

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 6 publications
0
1
0
2
Order By: Relevance
“…Relevansi dokumen terhadap sebuah query diukur berdasarkan kesamaan antara vektor dokumen dan vektor query. Pembobotan TF-IDF dan VSM digunakan untuk mewakili nilai numerik dokumen sehingga memungkinkan perhitungan kedekatan antara dokumen-dokumen (Zen et al, 2021). Semakin dekat dua vektor dalam VSM, maka semakin mirip dua dokumen yang diwakili oleh vektor tersebut.…”
Section: Visualisasi Data Menggunakan Vector Spaceunclassified
“…Relevansi dokumen terhadap sebuah query diukur berdasarkan kesamaan antara vektor dokumen dan vektor query. Pembobotan TF-IDF dan VSM digunakan untuk mewakili nilai numerik dokumen sehingga memungkinkan perhitungan kedekatan antara dokumen-dokumen (Zen et al, 2021). Semakin dekat dua vektor dalam VSM, maka semakin mirip dua dokumen yang diwakili oleh vektor tersebut.…”
Section: Visualisasi Data Menggunakan Vector Spaceunclassified
“…Sehingga data yang dibuat dapat dimasukan sebagai dataset dalam bentuk list [20]. Token kemudian dimasukan kedalam sebuah variabel dan diubah menjadi sebuah Tensor berisikan angka yang diproses melalui sebuah algortime [21]. Perhitungan bobot kata dimulai dengan menghitung nilai TF dengan bobot masing masing kata adalah 1, dan IDF dihitung dengan cara 𝑇𝐹 βˆ’ 𝐼𝐷𝐹(π‘‘π‘˜, 𝑑𝑗) = 𝑇𝐹(π‘‘π‘˜, 𝑑𝑗) * 𝐼𝐷𝐹(π‘‘π‘˜, 𝑑𝑗)…”
Section: G Tf-idf Tokenizingunclassified
“…The Bag of Words model, which employs Term Frequency -Inverse Document Frequency (TF-IDF), is a feature extraction technique that measures the significance of a word in a document by considering its relationship with other words in the document and assigning a weight to each word [19], [20]. In text mining, TF-IDF is a weighting factor that reflects the importance of a word [21]. The value of TF-IDF increases as the word frequency in the document increases, but it is reduced by the frequency of words in the entire corpus.…”
Section: Introductionmentioning
confidence: 99%