2019
DOI: 10.3389/fgene.2019.00460
|View full text |Cite
|
Sign up to set email alerts
|

Learning Causal Biological Networks With the Principle of Mendelian Randomization

Abstract: Although large amounts of genomic data are available, it remains a challenge to reliably infer causal (i. e., regulatory) relationships among molecular phenotypes (such as gene expression), especially when multiple phenotypes are involved. We extend the interpretation of the Principle of Mendelian randomization (PMR) and present MRPC, a novel machine learning algorithm that incorporates the PMR in the PC algorithm, a classical algorithm for learning causal graphs in computer science. MRPC learns a causal biolo… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
49
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 44 publications
(53 citation statements)
references
References 45 publications
(64 reference statements)
2
49
0
Order By: Relevance
“…CAUSE is a recent extension of MR that uses genome-wide summary statistics to model causal effects while accounting for pleiotropy 20 . Another type of algorithms address gene network inference by joint analysis of genetic variants and gene expression data in order to learn a large-scale graphical model with causal links among genes [21][22][23] . Recently, Howey et al explored similar methodology as an alternative for MR 24 .…”
mentioning
confidence: 99%
“…CAUSE is a recent extension of MR that uses genome-wide summary statistics to model causal effects while accounting for pleiotropy 20 . Another type of algorithms address gene network inference by joint analysis of genetic variants and gene expression data in order to learn a large-scale graphical model with causal links among genes [21][22][23] . Recently, Howey et al explored similar methodology as an alternative for MR 24 .…”
mentioning
confidence: 99%
“…Exploratory analysis (S15 Fig) using data from Simulation Study 3 suggested that the LCV test of GCP = 0 was not well-powered in this scenario, possibly because the concept of a genetic causality proportion (GCP) was not directly encapsulated by our simulation model. We are not the first to propose the utility of genetic anchors to help orient relationships between variables in the context of network analysis [43][44][45]47]. However, to our knowledge, this is the first study to directly compare the performance of MR and BN in both real and simulated data.…”
Section: Discussionmentioning
confidence: 98%
“…This feature is particularly useful when the study aims to address simultaneous causal relationships in "omics"scale data sets, for example in studies of gene expression [41] or metabolites [42]. Recent network-building methods have been developed that allow the analysis of potentially hundreds of variables, including both discrete and continuous data types, taking advantage of the ability of genetic variables to operate as causal anchors to help orient the direction of relationships between non-genetic variables [43][44][45][46][47]. These BN approaches could be considered complementary to the MR-based approaches [28] that enable the construction of such networks, as they use very different algorithms, are more restrictive in terms of requiring individual level input data, and produce different outputs, albeit in order to achieve similar goals.…”
Section: Plos Geneticsmentioning
confidence: 99%
“…The second algorithm (MRPC), proposed by Badsha and Fu [5], was first used in genetic data. This algorithm, identically to PC, has two distinct phases.…”
Section: Bayesian Network Based Causal Discovery Algorithmsmentioning
confidence: 99%