2022
DOI: 10.17977/um018v4i22021p105-116
|View full text |Cite
|
Sign up to set email alerts
|

Similarity Identification of Large-scale Biomedical Documents using Cosine Similarity and Parallel Computing

Abstract: Document similarity computation is an important research topic in information retrieval, and it is a crucial issue for automatic document categorization. The similarity value is between 0 and 1, then the closest value to 1 is represented both documents is considered more relevant, vice versa. However, the large scale of textual information has created the problem of finding the relevance level between documents. Therefore, the relevance between mesh heading text in the PubMed documents is higher than the relev… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 12 publications
0
0
0
Order By: Relevance
“…After the data is weighted, the next step is the data will be equated with the cosine similarity method [46], [47]. Cosine Similarity is two vectors that have a measure of similarity in dimensional space, which is obtained from the product of the two vectors being compared.…”
Section: Cosine Similaritymentioning
confidence: 99%
“…After the data is weighted, the next step is the data will be equated with the cosine similarity method [46], [47]. Cosine Similarity is two vectors that have a measure of similarity in dimensional space, which is obtained from the product of the two vectors being compared.…”
Section: Cosine Similaritymentioning
confidence: 99%