2017
DOI: 10.2196/medinform.7059
|View full text |Cite
|
Sign up to set email alerts
|

Search and Graph Database Technologies for Biomedical Semantic Indexing: Experimental Analysis

Abstract: BackgroundBiomedical semantic indexing is a very useful support tool for human curators in their efforts for indexing and cataloging the biomedical literature.ObjectiveThe aim of this study was to describe a system to automatically assign Medical Subject Headings (MeSH) to biomedical articles from MEDLINE.MethodsOur approach relies on the assumption that similar documents should be classified by similar MeSH terms. Although previous work has already exploited the document similarity by using a k-nearest neighb… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 19 publications
0
6
0
Order By: Relevance
“…Also, most research that deals with the classification problem of a large number of classes [2,[24][25][26][27][28][29] appears to rely on machine learning classifiers or deep neural networks. In our work, each thesaurus term has been moved to a vector space after training a doc2vec model with some corpus and using MeSH headings as tags.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Also, most research that deals with the classification problem of a large number of classes [2,[24][25][26][27][28][29] appears to rely on machine learning classifiers or deep neural networks. In our work, each thesaurus term has been moved to a vector space after training a doc2vec model with some corpus and using MeSH headings as tags.…”
Section: Discussionmentioning
confidence: 99%
“…Taking into account the assumption that similar documents are classified with similar MeSH terms, authors in [29] have proceeded to an implementation with an F-score of 0.69. This work starts with the conversion of documents into vectors by search engine indexing (Elastic Search) and the identification of the most similar documents based on cosine similarity.…”
Section: Mesh Indexingmentioning
confidence: 99%
“…Approaches based on k-NN have been widely used in large-scale multi-label categorization in many domains, including MEDLINE documents [37,39,40]. This preference for this lazy learning method is mainly due to its scalability, minimum parameter tuning and, despite its simplicity, its ability to deliver acceptable results when a large number of training samples are available.…”
Section: Similarity Based Categorization (K-nn)mentioning
confidence: 99%
“…Several state-of-the-art methods for MeSH indexing were introduced by teams participating in this challenge, most of them modeling the task as a multi-label learning problem [30]. Some relevant recent developments in MeSH indexing are MeSHLabeler [31], DeepMeSH [32], MeSH Now [33], AttentionMeSH [34], MeSHProbeNet [35], FullMeSH [26], BERTMeSH [36] and k-NN methods using ElasticSearch and MTI such as [37].…”
Section: Semantic Indexing In the Biomedical Domainmentioning
confidence: 99%
“…Another approach [ 25 ] for solving this issue uses the cosine similarity metric and a representation of the thesaurus as a graph database. It starts with the use of Elastic Search to convert texts into vectors, then with the cosine similarity metric it identifies the most similar texts.…”
Section: Related Workmentioning
confidence: 99%