Candidate gene and genome-wide association studies (GWAS) have identified genetic variants that modulate risk for human disease; many of these associations require further study to replicate the results. Here we report the first large-scale application of the phenome-wide association study (PheWAS) paradigm within electronic medical records (EMRs), an unbiased approach to replication and discovery that interrogates relationships between targeted genotypes and multiple phenotypes. We scanned for associations between 3,144 single-nucleotide polymorphisms (previously implicated by GWAS as mediators of human traits) and 1,358 EMR-derived phenotypes in 13,835 individuals of European ancestry. This PheWAS replicated 66% (51/77) of sufficiently powered prior GWAS associations and revealed 63 potentially pleiotropic associations with P < 4.6 × 10−6 (false discovery rate < 0.1); the strongest of these novel associations were replicated in an independent cohort (n = 7,406). These findings validate PheWAS as a tool to allow unbiased interrogation across multiple phenotypes in EMR-based cohorts and to enhance analysis of the genomic basis of human disease.
Glucose levels 2 h after an oral glucose challenge are a clinical measure of glucose tolerance used in the diagnosis of type 2 diabetes. We report a meta-analysis of nine genome-wide association studies (n = 15,234 nondiabetic individuals) and a follow-up of 29 independent loci (n = 6,958–30,620). We identify variants at the GIPR locus associated with 2-h glucose level (rs10423928, β (s.e.m.) = 0.09 (0.01) mmol/l per A allele, P = 2.0 × 10−15). The GIPR A-allele carriers also showed decreased insulin secretion (n = 22,492; insulinogenic index, P = 1.0 × 10−17; ratio of insulin to glucose area under the curve, P = 1.3 × 10−16) and diminished incretin effect (n = 804; P = 4.3 × 10−4). We also identified variants at ADCY5 (rs2877716, P = 4.2 × 10−16), VPS13C (rs17271305, P = 4.1 × 10−8), GCKR (rs1260326, P = 7.1 × 10−11) and TCF7L2 (rs7903146, P = 4.2 × 10−10) associated with 2-h glucose. Of the three newly implicated loci (GIPR, ADCY5 and VPS13C), only ADCY5 was found to be associated with type 2 diabetes in collaborating studies (n = 35,869 cases, 89,798 controls, OR = 1.12, 95% CI 1.09–1.15, P = 4.8 × 10−18).
SummaryBackgroundA high circulating concentration of interleukin 6 is associated with increased risk of coronary heart disease. Blockade of the interleukin-6 receptor (IL6R) with a monoclonal antibody (tocilizumab) licensed for treatment of rheumatoid arthritis reduces systemic and articular inflammation. However, whether IL6R blockade also reduces risk of coronary heart disease is unknown.MethodsApplying the mendelian randomisation principle, we used single nucleotide polymorphisms (SNPs) in the gene IL6R to evaluate the likely efficacy and safety of IL6R inhibition for primary prevention of coronary heart disease. We compared genetic findings with the effects of tocilizumab reported in randomised trials in patients with rheumatoid arthritis.FindingsIn 40 studies including up to 133 449 individuals, an IL6R SNP (rs7529229) marking a non-synonymous IL6R variant (rs8192284; p.Asp358Ala) was associated with increased circulating log interleukin-6 concentration (increase per allele 9·45%, 95% CI 8·34–10·57) as well as reduced C-reactive protein (decrease per allele 8·35%, 95% CI 7·31–9·38) and fibrinogen concentrations (decrease per allele 0·85%, 95% CI 0·60–1·10). This pattern of effects was consistent with IL6R blockade from infusions of tocilizumab (4–8 mg/kg every 4 weeks) in patients with rheumatoid arthritis studied in randomised trials. In 25 458 coronary heart disease cases and 100 740 controls, the IL6R rs7529229 SNP was associated with a decreased odds of coronary heart disease events (per allele odds ratio 0·95, 95% CI 0·93–0·97, p=1·53×10−5).InterpretationOn the basis of genetic evidence in human beings, IL6R signalling seems to have a causal role in development of coronary heart disease. IL6R blockade could provide a novel therapeutic approach to prevention of coronary heart disease that warrants testing in suitably powered randomised trials. Genetic studies in populations could be used more widely to help to validate and prioritise novel drug targets or to repurpose existing agents and targets for new therapeutic uses.FundingUK Medical Research Council; British Heart Foundation; Rosetrees Trust; US National Heart, Lung, and Blood Institute; Du Pont Pharma; Chest, Heart and Stroke Scotland; Wellcome Trust; Coronary Thrombosis Trust; Northwick Park Institute for Medical Research; UCLH/UCL Comprehensive Medical Research Centre; US National Institute on Aging; Academy of Finland; Netherlands Organisation for Health Research and Development; SANCO; Dutch Ministry of Public Health, Welfare and Sports; World Cancer Research Fund; Agentschap NL; European Commission; Swedish Heart-Lung Foundation; Swedish Research Council; Strategic Cardiovascular Programme of the Karolinska Institutet; Stockholm County Council; US National Institute of Neurological Disorders and Stroke; MedStar Health Research Institute; GlaxoSmithKline; Dutch Kidney Foundation; US National Institutes of Health; Netherlands Interuniversity Cardiology Institute of the Netherlands; Diabetes UK; European Union Seventh Framework Programme; Nationa...
Menopause timing has a substantial impact on infertility and risk of disease, including breast cancer, but the underlying mechanisms are poorly understood. We report a dual strategy in ~70,000 women to identify common and low-frequency protein-coding variation associated with age at natural menopause (ANM). We identified 44 regions with common variants, including two harbouring additional rare missense alleles of large effect. We found enrichment of signals in/near genes involved in delayed puberty, highlighting the first molecular links between the onset and end of reproductive lifespan. Pathway analyses revealed a major association with DNA damage-response (DDR) genes, including the first common coding variant in BRCA1 associated with any complex trait. Mendelian randomisation analyses supported a causal effect of later ANM on breast cancer risk (~6% risk increase per-year, P=3×10−14), likely mediated by prolonged sex hormone exposure, rather than DDR mechanisms.
Despite the diverse structure of the five EMRs of the eMERGE sites, we developed, validated, and successfully deployed 13 electronic phenotype algorithms. Validation is a worthwhile process that not only measures phenotype performance but also strengthens phenotype algorithm definitions and enhances their inter-institutional sharing.
We repurposed existing genotypes in DNA biobanks across the Electronic Medical Records and Genomics network to perform a genome-wide association study for primary hypothyroidism, the most common thyroid disease. Electronic selection algorithms incorporating billing codes, laboratory values, text queries, and medication records identified 1317 cases and 5053 controls of European ancestry within five electronic medical records (EMRs); the algorithms' positive predictive values were 92.4% and 98.5% for cases and controls, respectively. Four single-nucleotide polymorphisms (SNPs) in linkage disequilibrium at 9q22 near FOXE1 were associated with hypothyroidism at genome-wide significance, the strongest being rs7850258 (odds ratio [OR] 0.74, p = 3.96 × 10(-9)). This association was replicated in a set of 263 cases and 1616 controls (OR = 0.60, p = 5.7 × 10(-6)). A phenome-wide association study (PheWAS) that was performed on this locus with 13,617 individuals and more than 200,000 patient-years of billing data identified associations with additional phenotypes: thyroiditis (OR = 0.58, p = 1.4 × 10(-5)), nodular (OR = 0.76, p = 3.1 × 10(-5)) and multinodular (OR = 0.69, p = 3.9 × 10(-5)) goiters, and thyrotoxicosis (OR = 0.76, p = 1.5 × 10(-3)), but not Graves disease (OR = 1.03, p = 0.82). Thyroid cancer, previously associated with this locus, was not significantly associated in the PheWAS (OR = 1.29, p = 0.09). The strongest association in the PheWAS was hypothyroidism (OR = 0.76, p = 2.7 × 10(-13)), which had an odds ratio that was nearly identical to that of the curated case-control population in the primary analysis, providing further validation of the PheWAS method. Our findings indicate that EMR-linked genomic data could allow discovery of genes associated with many diseases without additional genotyping cost.
Objective To report the design and implementation of the Right Drug, Right Dose, Right Time: Using Genomic Data to Individualize Treatment Protocol that was developed to test the concept that prescribers can deliver genome guided therapy at the point-of-care by using preemptive pharmacogenomics (PGx) data and clinical decision support (CDS) integrated in the electronic medical record (EMR). Patients and Methods We used a multivariable prediction model to identify patients with a high risk of initiating statin therapy within 3 years. The model was used to target a study cohort most likely to benefit from preemptive PGx testing among Mayo Clinic Biobank participants with a recruitment goal of 1000 patients. Cox proportional hazards model was utilized using the variables selected through the Lasso shrinkage method. An operational CDS model was adapted to implement PGx rules within the EMR. Results The prediction model included age, sex, race, and 6 chronic diseases categorized by the Clinical Classifications Software for ICD-9 codes (dyslipidemia, diabetes, peripheral atherosclerosis, disease of the blood-forming organs, coronary atherosclerosis and other heart diseases, and hypertension). Of the 2000 Biobank participants invited, 50% provided blood samples, 13% refused, 28% did not respond, and 9% consented but did not provide a blood sample within the recruitment window (October 4, 2012 – March 20, 2013). Preemptive PGx testing included CYP2D6 genotyping and targeted sequencing of 84 PGx genes. Synchronous real-time CDS is integrated in the EMR and flags potential patient-specific drug-gene interactions and provides therapeutic guidance. Conclusion These interventions will improve understanding and implementation of genomic data in clinical practice.
Menopause timing has a substantial impact on infertility and risk of disease, including breast cancer, but the underlying mechanisms are poorly understood. We report a dual strategy in ~7 0,000 women to identify common and low-frequency protein-coding variation associated with age at natural menopause (ANM). We identified 44 regions with common variants, including two harbouring additional rare missense alleles of large effect. We found enrichment of signals in/near genes involved in delayed puberty, highlighting the first molecular links between the onset and end of reproductive lifespan. Pathway analyses revealed a major association with DNA damageresponse (DDR) genes, including the first common coding variant in BRCA1 associated with any complex trait. Mendelian randomisation analyses supported a causal effect of later ANM on breast cancer risk (~6% risk increase per-year, P=3×10 −14 ), likely mediated by prolonged sex hormone exposure, rather than DDR mechanisms.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.