The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2017
DOI: 10.1186/s40537-017-0083-6
|View full text |Cite
|
Sign up to set email alerts
|

Improved sqrt-cosine similarity measurement

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
42
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 72 publications
(43 citation statements)
references
References 35 publications
0
42
0
1
Order By: Relevance
“…Cosine Similarity merupakan metode yang digunakan untuk mengidentifikasi terjadinya kasus cyberbullying [21] seperti Rumus 4. Meningkatkan efektifitas untuk deteksi menggunakan pengembangan dari metode Cosine Similarity yaitu menggunakan metode Improved Sqrt-Cosine (ISC) menggunakan Rumus 4 [22].…”
Section: Improved Sqrt-cosine (Isc)unclassified
“…Cosine Similarity merupakan metode yang digunakan untuk mengidentifikasi terjadinya kasus cyberbullying [21] seperti Rumus 4. Meningkatkan efektifitas untuk deteksi menggunakan pengembangan dari metode Cosine Similarity yaitu menggunakan metode Improved Sqrt-Cosine (ISC) menggunakan Rumus 4 [22].…”
Section: Improved Sqrt-cosine (Isc)unclassified
“…It tends to be small for training samples belonging to the same feature set. In our case, since we're dealing with POI embedding vectors, we calculate the edge weight between a pair of POIs by measuring the similarity between embedding vectors of two POIs using improved sqrt-cosine similarity [26]:…”
Section: Mapping Poi Embeddings Based On Feature Importancementioning
confidence: 99%
“…Term frequency (tf ) [3], inverse document frequency (idf ) [12], or multiplication of tf and idf (tf-idf ) [13][14][15] are commonly used term weighting schemes. In large-scale text document collections, using VSM results sparse vectors, i.e., most of the term weights in a document vector are zero [16,17]. High dimensionality can be a problem for computing the similarity between two documents.…”
Section: Page 2 Of 23mentioning
confidence: 99%
“…For two 0-1 vectors, 2 the Hamming distance [17] is the number of positions at which the stored term weights are different. The Chebyshev distance [16] between two vectors is the greatest of absolute differences along any dimension. A similarity measure for text processing (SMTP) [17] is used for comparing two text documents.…”
Section: Page 2 Of 23mentioning
confidence: 99%
See 1 more Smart Citation