Results from genome-wide association studies (GWAS) can be used to infer causal relationships between phenotypes, using a strategy known as 2-sample Mendelian randomization (2SMR) and bypassing the need for individual-level data. However, 2SMR methods are evolving rapidly and GWAS results are often insufficiently curated, undermining efficient implementation of the approach. We therefore developed MR-Base (http://www.mrbase.org): a platform that integrates a curated database of complete GWAS results (no restrictions according to statistical significance) with an application programming interface, web app and R packages that automate 2SMR. The software includes several sensitivity analyses for assessing the impact of horizontal pleiotropy and other violations of assumptions. The database currently comprises 11 billion single nucleotide polymorphism-trait associations from 1673 GWAS and is updated on a regular basis. Integrating data with software ensures more rigorous application of hypothesis-driven analyses and allows millions of potential causal relationships to be efficiently evaluated in phenome-wide association studies.
Purpose of ReviewMendelian randomization (MR) is a strategy for evaluating causality in observational epidemiological studies. MR exploits the fact that genotypes are not generally susceptible to reverse causation and confounding, due to their fixed nature and Mendel’s First and Second Laws of Inheritance. MR has the potential to provide information on causality in many situations where randomized controlled trials are not possible, but the results of MR studies must be interpreted carefully to avoid drawing erroneous conclusions.Recent FindingsIn this review, we outline the principles behind MR, as well as assumptions and limitations of the method. Extensions to the basic approach are discussed, including two-sample MR, bidirectional MR, two-step MR, multivariable MR, and factorial MR. We also consider some new applications and recent developments in the methodology, including its ability to inform drug development, automation of the method using tools such as MR-Base, and phenome-wide and hypothesis-free MR.SummaryIn conjunction with the growing availability of large-scale genomic databases, higher level of automation and increased robustness of the methods, MR promises to be a valuable strategy to examine causality in complex biological/omics networks, inform drug development and prioritize intervention targets for disease prevention in the future.
The human proteome is a major source of therapeutic targets. Recent genetic association analyses of the plasma proteome enable systematic evaluation of the causal consequences of variation in plasma protein levels. Here we estimated the effects of 1,002 proteins on 225 phenotypes using two-sample Mendelian randomization (MR) and colocalization. Of 413 associations supported by evidence from MR, 130 (31.5%) were not supported by results of colocalization analyses, suggesting that genetic confounding due to linkage disequilibrium (LD) is widespread in naïve phenome-wide association studies of proteins. Combining MR and colocalization evidence in cis-only analyses, we identified 111 putatively causal effects between 65 proteins and 52 disease-related phenotypes ( www.epigraphdb.org/pqtl/ ). Evaluation of data from historic drug development programs showed that target-indication pairs with MR and colocalization support were more likely to be approved, evidencing the value of this approach in identifying and prioritizing potential therapeutic targets.
Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease with a life-time risk of 1 in 350 people and an unmet need for disease-modifying therapies. We conducted a cross-ancestry GWAS in ALS including 29,612 ALS patients and 122,656 controls which identified 15 risk loci in ALS. When combined with 8,953 whole-genome sequenced individuals (6,538 ALS patients, 2,415 controls) and the largest cortex-derived eQTL dataset (MetaBrain), analyses revealed locus-specific genetic architectures in which we prioritized genes either through rare variants, repeat expansions or regulatory effects. ALS associated risk loci were shared with multiple traits within the neurodegenerative spectrum, but with distinct enrichment patterns across brain regions and cell-types. Of the environmental and life-style risk factors obtained from literature, Mendelian randomization analyses indicated a causal role for high cholesterol levels. All ALS associated signals combined reveal a role for perturbations in vesicle mediated transport and autophagy, and provide evidence for cell-autonomous disease initiation in glutamatergic neurons.
Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease with a lifetime risk of one in 350 people and an unmet need for disease-modifying therapies. We conducted a cross-ancestry genome-wide association study (GWAS) including 29,612 patients with ALS and 122,656 controls, which identified 15 risk loci. When combined with 8,953 individuals with whole-genome sequencing (6,538 patients, 2,415 controls) and a large cortex-derived expression quantitative trait locus (eQTL) dataset (MetaBrain), analyses revealed locus-specific genetic architectures in which we prioritized genes either through rare variants, short tandem repeats or regulatory effects. ALS-associated risk loci were shared with multiple traits within the neurodegenerative spectrum but with distinct enrichment patterns across brain regions and cell types. Of the environmental and lifestyle risk factors obtained from the literature, Mendelian randomization analyses indicated a causal role for high cholesterol levels. The combination of all ALS-associated signals reveals a role for perturbations in vesicle-mediated transport and autophagy and provides evidence for cell-autonomous disease initiation in glutamatergic neurons.
The human proteome is a major source of therapeutic targets. Recent genetic association analyses of the plasma proteome enable systematic evaluation of the causal consequences of variation in plasma protein levels. Here, we estimated the effects of 1002 proteins on 225 phenotypes using two-sample Mendelian randomization (MR) and colocalization. Of 413 associations supported by evidence from MR, 130 (31.5%) were not supported by results of colocalization analyses, suggesting that genetic confounding due to linkage disequilibrium (LD) is widespread in naive phenome-wide association studies of proteins. Combining MR and colocalization evidence in cis-only analyses, we identified 111 putatively causal effects between 65 proteins and 52 disease-related phenotypes (www.epigraphdb.org/pqtl/). Evaluation of data from historic drug development programmes showed that target-indication pairs with MR and colocalization support were more likely to be approved, evidencing the value of our approach in identifying and prioritising potential therapeutic targets.
Gaining insight into the downstream consequences of non-coding variants is an essential step towards the identification of therapeutic targets from genome-wide association study (GWAS) findings. Here we have harmonized and integrated 8,727 RNA-seq samples with accompanying genotype data from multiple brain-regions from 14 datasets. This sample size enabled us to perform both cis- and trans-expression quantitative locus (eQTL) mapping. Upon comparing the brain cortex cis-eQTLs (for 12,307 unique genes at FDR<0.05) with a large blood cis-eQTL analysis (n=31,684 samples), we observed that brain eQTLs are more tissue specific than previously assumed. We inferred the brain cell type for 1,515 cis-eQTLs by using cell type proportion information. We conducted Mendelian Randomization on 31 brain-related traits using cis-eQTLs as instruments and found 159 significant findings that also passed colocalization. Furthermore, two multiple sclerosis (MS) findings had cell type specific signals, a neuron-specific cis-eQTL for CYP24A1 and a macrophage specific cis-eQTL for CLECL1. To further interpret GWAS hits, we performed trans-eQTL analysis. We identified 2,589 trans-eQTLs (at FDR<0.05) for 373 unique SNPs, affecting 1,263 unique genes, and 21 replicated significantly using single-nucleus RNA-seq data from excitatory neurons. We also generated a brain-specific gene-coregulation network that we used to predict which genes have brain-specific functions, and to perform a novel network analysis of Alzheimer's disease (AD), amyotrophic lateral sclerosis (ALS), multiple sclerosis (MS) and Parkinson's disease (PD) GWAS data. This resulted in the identification of distinct sets of genes that show significantly enriched co-regulation with genes inside the associated GWAS loci, and which might reflect drivers of these diseases.
We aimed to report the first genomewide association study (GWAS) meta‐analysis of dual‐energy X‐ray absorptiometry (DXA)‐derived hip shape, which is thought to be related to the risk of both hip osteoarthritis and hip fracture. Ten hip shape modes (HSMs) were derived by statistical shape modeling using SHAPE software, from hip DXA scans in the Avon Longitudinal Study of Parents and Children (ALSPAC; adult females), TwinsUK (mixed sex), Framingham Osteoporosis Study (FOS; mixed), Osteoporotic Fractures in Men study (MrOS), and Study of Osteoporotic Fractures (SOF; females) (total N = 15,934). Associations were adjusted for age, sex, and ancestry. Five genomewide significant ( p < 5 × 10 −9 , adjusted for 10 independent outcomes) single‐nucleotide polymorphisms (SNPs) were associated with HSM1, and three SNPs with HSM2. One SNP, in high linkage disequilibrium with rs2158915 associated with HSM1, was associated with HSM5 at genomewide significance. In a look‐up of previous GWASs, three of the identified SNPs were associated with hip osteoarthritis, one with hip fracture, and five with height. Seven SNPs were within 200 kb of genes involved in endochondral bone formation, namely SOX9 , PTHrP , RUNX1 , NKX3‐2 , FGFR4 , DICER1 , and HHIP . The SNP adjacent to DICER1 also showed osteoblast cis‐regulatory activity of GSC , in which mutations have previously been reported to cause hip dysplasia. For three of the lead SNPs, SNPs in high LD ( r 2 > 0.5) were identified, which intersected with open chromatin sites as detected by ATAC‐seq performed on embryonic mouse proximal femora. In conclusion, we identified eight SNPs independently associated with hip shape, most of which were associated with height and/or mapped close to endochondral bone formation genes, consistent with a contribution of processes involved in limb growth to hip shape and pathological sequelae. These findings raise the possibility that genetic studies of hip shape might help in understanding potential pathways involved in hip osteoarthritis and hip fracture. © 2018 The Authors. Journal of Bone and Mineral Research Published by Wiley Periodicals, Inc.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.