Background: High-dimensional omics datasets provide valuable resources to determine the causal role of molecular traits in mediating the path from genotype to phenotype. Making use of quantitative trait loci (QTL) and genome-wide association studies (GWASs) summary statistics, we developed a multivariable Mendelian randomization (MVMR) framework to quantify the connectivity between three omics layers (DNA methylome (DNAm), transcriptome and proteome) and their cascading causal impact on complex traits and diseases.
Results: Evaluating 50 complex traits, we found that on average 37.8% (95% CI: [36.0%-39.5%]) of DNAm-to-trait effects were mediated through transcripts in the cis-region, while only 15.8% (95% CI: [11.9%-19.6%]) are mediated through proteins in cis. DNAm sites typically regulate multiple transcripts, and while found to predominantly decrease gene expression, this was only the case for 53.4% across ~47,000 significant DNAm-transcript pairs. The average mediation proportion for transcript-to-trait effects through proteins (encoded for by the assessed transcript or located in trans) was estimated to be 5.27% (95%CI: [4.11%-6.43%]). Notable differences in the transcript and protein QTL architectures were detected with only 22% of protein levels being causally driven by their corresponding transcript levels. Several regulatory mechanisms were hypothesized including an example where cg10385390 (chr1:8,022,505) increases the risk of irritable bowel disease by reducing PARK7 transcript and protein expression.
Conclusions: The proposed integrative framework identified putative causal chains through omics layers providing a powerful tool to map GWAS signals. Quantification of causal effects between successive layers indicated that molecular mechanisms can be more complex than what the central dogma of biology would suggest.