2013
DOI: 10.1186/1755-8794-6-s2-s3
|View full text |Cite
|
Sign up to set email alerts
|

Compensating for literature annotation bias when predicting novel drug-disease relationships through Medical Subject Heading Over-representation Profile (MeSHOP) similarity

Abstract: BackgroundUsing annotations to the articles in MEDLINE®/PubMed®, over six thousand chemical compounds with pharmacological actions have been tracked since 1996. Medical Subject Heading Over-representation Profiles (MeSHOPs) quantitatively leverage the literature associated with biological entities such as diseases or drugs, providing the opportunity to reposition known compounds towards novel disease applications.MethodsA MeSHOP is constructed by counting the number of times each medical subject term is assign… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 9 publications
(8 citation statements)
references
References 26 publications
0
8
0
Order By: Relevance
“…Our expectation is that if the frequently seen genes are arising as candidates in more studies, and are less likely to be truly pathogenic, then they could be associated to a wider range of phenotypes in the literature (we recognize the association could also be due to pleiotropy [ 45 ], see Limitations). To analyze if FLAGS have been frequently correlated to human diseases, we used two different computational resources (MeSHOP [ 32 ], HPO [ 33 ]) to extract known significant relationship(s) between genes and human disease phenotypes based on published scientific articles. Figures 5 a and b show the distribution of the number of disease terms from HPO and MeSHOP per gene within gene sets.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Our expectation is that if the frequently seen genes are arising as candidates in more studies, and are less likely to be truly pathogenic, then they could be associated to a wider range of phenotypes in the literature (we recognize the association could also be due to pleiotropy [ 45 ], see Limitations). To analyze if FLAGS have been frequently correlated to human diseases, we used two different computational resources (MeSHOP [ 32 ], HPO [ 33 ]) to extract known significant relationship(s) between genes and human disease phenotypes based on published scientific articles. Figures 5 a and b show the distribution of the number of disease terms from HPO and MeSHOP per gene within gene sets.…”
Section: Resultsmentioning
confidence: 99%
“…We used MeSHOP software [ 32 ] to identify over-represented disease terms associated with each gene. MeSHOP returns a list of MeSH (Medical Subject Heading) terms for each gene with a p-value for each term.…”
Section: Methodsmentioning
confidence: 99%
“…Clinicians emphasized that while there are tools that offer online software applications to obtain candidate genes based upon keyword queries (e.g., MeSHOP, 34 Genie, 35 Ingenuity [ http://www.ingenuity.com ]), these capabilities are not consistently accessible to integrated WES/WGS analysis software and the output cannot be combined with exome data without additional manipulation. Expanding beyond keywords as input, clinicians further requested graphical search functionalities.…”
Section: Resultsmentioning
confidence: 99%
“…Thus, we test the effectiveness of our framework to enrich drug reposition candidates using a permutation test. We use the co-occurrence of a drug and a disease in biomedical literature and clinical trials database as a proxy of biological plausibility of the drug reposition, since it can be influenced by literature bias ( 20 , 21 ). Our study shows that 28% (14 816 out of 52 966) of the transitive drug–disease pairs from PheWAS were supported in at least one Medline abstract while permuting drug–disease pairs will only identify 6.2% of random drug–disease words in the literature.…”
Section: Database Creationmentioning
confidence: 99%
“…Furthermore, to estimate the novelty of the drug–disease pairs and to prioritize them, we cross referenced all pairs with the NIH clinical trial registry (i) and also used the biomedical literature co-occurrences of drug–disease pairs (ii) to categorize each PheWAS and GWAS generated pair in one of these four categories: ‘known/rediscovered’ for relationships that already exist in the DrugBank, ‘strongly supported’ for pairs with some evidence in both (i) and (ii), ‘Likely’ for pair with support in either (i) or (ii) and ‘novel’ for pairs with no evidence in (i) or (ii). Note that the lack of occurrence of both drug and disease terms together in the literature could potentially indicate a novel finding, a bias of literature annotations or a true lack of association ( 20 , 21 ).…”
Section: Database Creationmentioning
confidence: 99%