The platform will undergo maintenance on Sep 14 at about 9:30 AM EST and will be unavailable for approximately 1 hour.
2017
DOI: 10.1093/database/bax065
|View full text |Cite
|
Sign up to set email alerts
|

Query expansion using MeSH terms for dataset retrieval: OHSU at the bioCADDIE 2016 dataset retrieval challenge

Abstract: Scientific data are being generated at an ever-increasing rate. The Biomedical and Healthcare Data Discovery Index Ecosystem (bioCADDIE) is an NIH-funded Data Discovery Index that aims to provide a platform for researchers to locate, retrieve, and share research datasets. The bioCADDIE 2016 Dataset Retrieval Challenge was held to identify the most effective dataset retrieval methods. We aimed to assess the value of Medical Subject Heading (MeSH) term-based query expansion to improve retrieval. Our system, base… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 31 publications
0
8
0
Order By: Relevance
“…In biomedical area UMLS, MeSH ( 22 ), SNOMED-CT, ICD-10, WordNet and Wikipedia are used ( 30 ). Generally, the result of lexicon type expansion is positive (in the bioCADDIE contest see for example ( 19 , 20 )). We did not use this method in our work because of lack of access to MeSH medical text indexer service.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…In biomedical area UMLS, MeSH ( 22 ), SNOMED-CT, ICD-10, WordNet and Wikipedia are used ( 30 ). Generally, the result of lexicon type expansion is positive (in the bioCADDIE contest see for example ( 19 , 20 )). We did not use this method in our work because of lack of access to MeSH medical text indexer service.…”
Section: Resultsmentioning
confidence: 99%
“…Additional runs determined the optimal number of MeSH terms and weighting. Their best overall score used five MeSH terms with a 1:5 terms: words weighting ratio ( 19 ). This is the same ratio we used in our best run when query expanded terms are derived from word2vec.…”
Section: Related Workmentioning
confidence: 99%
“…erefore, the query expansion method is introduced into the QA model, which makes up the semantic gap between questions and answers by adding words related to the answers to the original query. In the field of medical, external medical knowledge resources such as MeSH [9], UMLS [10], and several medical ontology databases [11] are employed as the source of extension words. However, the query expansion only based on synonyms is incapable of accurately capturing the semantic information in the corpus.…”
Section: Question Answering Based On Query Expansionmentioning
confidence: 99%
“…In addition to that biomedical and healthCAre Data Discovery Index Ecosystem (bioCADDIE) dataset retrieval challenge was organized in 2016 to evaluate the effectiveness of information retrieval (IR) techniques in identifying relevant biomedical datasets in DataMed ( 3 ). Among the teams participated in this shared task, use of probabilistic or machine learning based IR ( 4 ), medical subject headings (MeSH) term based query expansion ( 5 ), word embeddings and identifying named entity ( 6 ), and re-ranking ( 7 ) for searching datasets using a query were the prevalent approaches. Similarly, a specialized search engine named Omicseq was developed for retrieving omics data ( 8 ).…”
Section: Introductionmentioning
confidence: 99%