2017
DOI: 10.1371/journal.pcbi.1005662
|View full text |Cite
|
Sign up to set email alerts
|

Learning causal networks with latent variables from multivariate information in genomic data

Abstract: Learning causal networks from large-scale genomic data remains challenging in absence of time series or controlled perturbation experiments. We report an information- theoretic method which learns a large class of causal or non-causal graphical models from purely observational data, while including the effects of unobserved latent variables, commonly found in many genomic datasets. Starting from a complete graph, the method iteratively removes dispensable edges, by uncovering significant information contributi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
56
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 35 publications
(56 citation statements)
references
References 68 publications
0
56
0
Order By: Relevance
“…To distinguish between direct and indirect interactions, several network construction tools use a probabilistic graphical model Kurtz et al (2015); Yang et al (2017), e.g. SPIEC- EASI Kurtz et al (2019, miic Verny et al (2017), or FlashWeave Tackmann et al (2019. FlashWeave can also integrate metadata to remove indirect associations driven by environmental factors.…”
Section: Removing Indirect Dependencies Including Environmental Effectsmentioning
confidence: 99%
“…To distinguish between direct and indirect interactions, several network construction tools use a probabilistic graphical model Kurtz et al (2015); Yang et al (2017), e.g. SPIEC- EASI Kurtz et al (2019, miic Verny et al (2017), or FlashWeave Tackmann et al (2019. FlashWeave can also integrate metadata to remove indirect associations driven by environmental factors.…”
Section: Removing Indirect Dependencies Including Environmental Effectsmentioning
confidence: 99%
“…As the Set 3 genes specified the perivascular population corresponding to the major cell cluster A we took advantage of the large number of Leptin receptor-positive (LEPR) single cells to refine the V gene network. To identify direct paths between genes including causal relationships and inferring latent common regulators of expressed genes we applied the multivariate information-based inductive causation algorithm (miic) (Sella et al, 2018;Verny et al, 2017), submitting to it the matrix consisting of 1,712 observations and 109 variables (the Set 3 genes). As expected only a fraction of nodes (52) defining direct high-confidence paths were retained ( Figure 4C).…”
Section: Single-cell-level Analysis Identifies High-confidence Pathsmentioning
confidence: 99%
“…We applied a discriminant analysis pipeline to identify the genes predominantly expressed by the different cell sets and analyzed the different cell populations to unravel the gene relatedness (Horvath, 2011;Langfelder and Horvath, 2008). Moreover, we took advantage of the numerous observations provided by recently published single-cell transcriptomes (Tikhonova et al, 2019) to disclose in the perivascular network the direct gene-to-gene interactions and reveal oriented links based on the signature of causality (Sella et al, 2018;Verny et al, 2017). Finally, as proof of concept, we studied the protein expression and activity of the top most discriminant and connected gene, the Wnt signaling pathway facilitator R-spondin 2 (Rspo2), and validated that it was expressed by a small set of perivascular cells, and acted in concert with the Stem Cell Factor (SCF) (also known as Kit Ligand) to amplify ex vivo hematopoietic precursors.…”
Section: Introductionmentioning
confidence: 99%
“…Gene ontology analysis was done using the human genome as background (Table S1). Functional gene network reconstruction was achieved using the information-theoretic method, MIIC (multivariate information-based inductive causation, Supplementary Information 1 [58,65]). Four independent datasets comprising 153 to 485 primary GBM transcriptomes were used for patient survival analysis (Table S1).…”
Section: Computational Analysesmentioning
confidence: 99%