Biocomputing 2017 2016
DOI: 10.1142/9789813207813_0015
|View full text |Cite
|
Sign up to set email alerts
|

Empowering Multi-Cohort Gene Expression Analysis to Increase Reproducibility

Abstract: A major contributor to the scientific reproducibility crisis has been that the results from homogeneous, single-center studies do not generalize to heterogeneous, real world populations. Multi-cohort gene expression analysis has helped to increase reproducibility by aggregating data from diverse populations into a single analysis. To make the multi-cohort analysis process more feasible, we have assembled an analysis pipeline which implements rigorously studied meta-analysis best practices. We have compiled and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
104
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
7
2

Relationship

4
5

Authors

Journals

citations
Cited by 89 publications
(107 citation statements)
references
References 29 publications
3
104
0
Order By: Relevance
“…Then, for each disease, the we applied a multi-cohort analysis framework to identify disease gene signatures. This framework 28 has been shown to identify reproducible gene signatures 29 across multiple independent cohorts in different disease conditions, including bacterial and viral infections, organ transplantation, and cancer for identifying diagnostic and prognostic disease signatures, novel drug targets and repurposing FDA-approved drugs 26 , 29 – 34 . Next, we performed a GO enrichment analysis via traditional over-representation statistical methods for each disease gene signature, producing a set of enriched GO terms.…”
Section: Resultsmentioning
confidence: 99%
“…Then, for each disease, the we applied a multi-cohort analysis framework to identify disease gene signatures. This framework 28 has been shown to identify reproducible gene signatures 29 across multiple independent cohorts in different disease conditions, including bacterial and viral infections, organ transplantation, and cancer for identifying diagnostic and prognostic disease signatures, novel drug targets and repurposing FDA-approved drugs 26 , 29 – 34 . Next, we performed a GO enrichment analysis via traditional over-representation statistical methods for each disease gene signature, producing a set of enriched GO terms.…”
Section: Resultsmentioning
confidence: 99%
“…Having access to detailed phenotypic data can improve the models and allow us to explore other relevant questions in association with the microbial genera. Finally, the clinical definition of PTB may be different depending on the exact obstetrical definition that was used in the individual studies, however, the majority of the cohorts we investigated focused on spontaneous PTB and the signals that are robust despite the potential patient heterogeneity are more likely to be real as we have seen from prior gene expression meta-analysis studies (Dudley et al, 2009;Haynes et al, 2017;Sweeney et al, 2017). As more data becomes publicly available, we hope that additional standardization and meta-data availability will help address some of the aforementioned issues.…”
Section: Discussionmentioning
confidence: 99%
“…Given that aberrant IFN signalling has been implicated in a broad range of infectious or autoimmune diseases, we further asked whether the IFN interactome could be used to interpret existing molecular datasets relevant to human pathologies. We drew on a resource of multi-cohort gene expression meta-analyses for 103 diseases [36,37] to identify genes with reproducible evidence of differential expression in eleven diseases characterized by an elevated IFN transcriptional signature [8] , including viral infections, systemic and discoid lupus erythematosus, rheumatoid arthritis, sarcoidosis, and Sjogren's syndrome. We mapped protein-protein interactions for dozens to hundreds of differentially expressed genes from each disease ( Figure 3I); notably, interactions for many such gene products were identified exclusively in the IFN-stimulated condition.…”
Section: Biological Relevance Of the Ifn Interactomementioning
confidence: 99%