Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.
Coronary artery disease (CAD) is a leading cause of death, yet its genetic determinants are not fully elucidated. We report a multi-ethnic genome-wide association study of CAD involving nearly a quarter of a million cases, incorporating the largest cohorts to date of Whites, Blacks, and Hispanics from the Million Veteran Program with existing studies including CARDIoGRAMplusC4D, UK Biobank, and Biobank Japan. We verify substantial and nearly equivalent heritability of CAD across multiple ancestral groups, discover 107 novel loci including the first nine on the X-chromosome, identify the first eight genome-wide significant loci among Blacks and Hispanics, and demonstrate that two common haplotypes are largely responsible for the risk stratification at the well-known 9p21 locus in most populations except those of African origin where both haplotypes are virtually absent. We identify 15 loci for angiographically derived burden of coronary atherosclerosis, which robustly overlap with the strongest and earliest loci reported to date for clinical CAD. Phenome-wide association analyses of novel loci and externally validated polygenic risk scores (PRS) augment signals from the insulin resistance cluster of risk factors and consequences, extend previously established pleiotropic associations of loci with traditional risk factors to include smoking and family history, and confirm a substantially reduced transferability of existing PRS to Blacks. Downstream integrative genomic analyses reinforce the critical role of endothelial, fibroblast, and smooth muscle cells within the coronary vessel wall in CAD susceptibility. Our study highlights the value of a multi-ethnic design in efficiently characterizing the genetic architecture of CAD across all human populations.
Background: Abdominal aortic aneurysm (AAA) is an important cause of cardiovascular mortality; however, its genetic determinants remain incompletely defined. In total, 10 previously identified risk loci explain a small fraction of AAA heritability. Methods: We performed a genome-wide association study in the Million Veteran Program testing ≈18 million DNA sequence variants with AAA (7642 cases and 172 172 controls) in veterans of European ancestry with independent replication in up to 4972 cases and 99 858 controls. We then used mendelian randomization to examine the causal effects of blood pressure on AAA. We examined the association of AAA risk variants with aneurysms in the lower extremity, cerebral, and iliac arterial beds, and derived a genome-wide polygenic risk score (PRS) to identify a subset of the population at greater risk for disease. Results: Through a genome-wide association study, we identified 14 novel loci, bringing the total number of known significant AAA loci to 24. In our mendelian randomization analysis, we demonstrate that a genetic increase of 10 mm Hg in diastolic blood pressure (odds ratio, 1.43 [95% CI, 1.24–1.66]; P =1.6×10 −6 ), as opposed to systolic blood pressure (odds ratio, 1.06 [95% CI, 0.97–1.15]; P =0.2), likely has a causal relationship with AAA development. We observed that 19 of 24 AAA risk variants associate with aneurysms in at least 1 other vascular territory. A 29-variant PRS was strongly associated with AAA (odds ratio PRS , 1.26 [95% CI, 1.18–1.36]; P PRS =2.7×10 −11 per SD increase in PRS), independent of family history and smoking risk factors (odds ratio PRS+family history+smoking , 1.24 [95% CI, 1.14–1.35]; P PRS =1.27×10 −6 ). Using this PRS, we identified a subset of the population with AAA prevalence greater than that observed in screening trials informing current guidelines. Conclusions: We identify novel AAA genetic associations with therapeutic implications and identify a subset of the population at significantly increased genetic risk of AAA independent of family history. Our data suggest that extending current screening guidelines to include testing to identify those with high polygenic AAA risk, once the cost of genotyping becomes comparable with that of screening ultrasound, would significantly increase the yield of current screening at reasonable cost.
Spontaneous coronary artery dissection (SCAD) is a non-atherosclerotic cause of myocardial infarction (MI), typically in young women. We undertook a genome-wide association study of SCAD (N cases = 270/N controls = 5,263) and identified and replicated an association of rs12740679 at chromosome 1q21.2 (P discovery+replication = 2.19 × 10 −12 , OR = 1.8) influencing ADAMTSL4 expression. Meta-analysis of discovery and replication samples identified associations with P < 5 × 10 −8 at chromosome 6p24.1 in PHACTR1, chromosome 12q13.3 in LRP1, and in females-only, at chromosome 21q22.11 near LINC00310. A polygenic risk score for SCAD was associated with (1) higher risk of SCAD in individuals with fibromuscular dysplasia (P = 0.021, OR = 1.82 [95% CI: 1.09-3.02]) and (2) lower risk of atherosclerotic coronary artery disease and MI in the UK Biobank (P = 1.28 × 10 −17 , HR = 0.91 [95% CI :0.89-0.93], for MI) and Million Veteran Program (P = 9.33 × 10 −36 , OR = 0.95 [95% CI: 0.94-0.96], for CAD; P = 3.35 × 10 −6 , OR = 0.96 [95% CI: 0.95-0.98] for MI). Here we report that SCADrelated MI and atherosclerotic MI exist at opposite ends of a genetic risk spectrum, inciting MI with disparate underlying vascular biology.
We investigated type 2 diabetes (T2D) genetic susceptibility in a multi-ethnic meta-analysis of 228,499 cases and 1,178,783 controls in the Million Veteran Program (MVP) and other biobanks. We identified 558 autosomal and 10 X-chromosome T2D-associated variants, of which 286 autosomal and 7 X-chromosome variants were previously unreported. Ancestry-specific analyses identified 25 additional novel T2D-susceptibility variants. Transcriptome-wide association analysis detected 3,568 T2D-associations with T2D-colocalized genetically predicted gene expression of 804 genes in 52 tissues, of which 687 are novel. Fifty-four of these genes are known to interact with FDA-approved drugs and chemical compounds. T2D polygenic risk score was strongly associated with increased the risk of T2D-related retinopathy, and additionally showed evidence for association with chronic kidney disease (CKD), neuropathy, and peripheral artery disease (PAD). We investigated the genetic etiology of T2D-related vascular outcomes in the MVP and observed statistical SNP-T2D interactions at 13 variants, including 3 for coronary heart disease, 1 for PAD, 2 for stroke, 4 for retinopathy, 2 for CKD, and 1 for neuropathy. Our findings may identify potential novel therapeutic targets for T2D and genomic pathways that link T2D and its vascular outcomes.
Background: Although lower-complexity cardiac malformations constitute the majority of adult congenital heart disease (ACHD), the long-term risks of adverse cardiovascular events and relationship with conventional risk factors in this population are poorly understood. We aimed to quantify the risk of adverse cardiovascular events associated with lower-complexity ACHD that is unmeasured by conventional risk factors. Methods: A multi-tiered classification algorithm was used to select individuals with lowercomplexity ACHD and individuals without ACHD for comparison amongst >500,000 British adults in the UK Biobank (UKB). ACHD diagnoses were sub-classified as "isolated aortic valve (AoV)" and "non-complex" defects. Time-to-event analyses were conducted for primary endpoints of fatal or non-fatal acute coronary syndrome (ACS), ischemic stroke, heart failure (HF), and atrial fibrillation, and a secondary combined endpoint for major adverse cardiovascular event (MACE).
Identification of individuals at highest risk of coronary artery disease (CAD)—ideally before onset—remains an important public health need. Prior studies have developed genome-wide polygenic scores to enable risk stratification, reflecting the substantial inherited component to CAD risk. Here we develop a new and significantly improved polygenic score for CAD, termed GPSMult, that incorporates genome-wide association data across five ancestries for CAD (>269,000 cases and >1,178,000 controls) and ten CAD risk factors. GPSMult strongly associated with prevalent CAD (odds ratio per standard deviation 2.14, 95% confidence interval 2.10–2.19, P < 0.001) in UK Biobank participants of European ancestry, identifying 20.0% of the population with 3-fold increased risk and conversely 13.9% with 3-fold decreased risk as compared with those in the middle quintile. GPSMult was also associated with incident CAD events (hazard ratio per standard deviation 1.73, 95% confidence interval 1.70–1.76, P < 0.001), identifying 3% of healthy individuals with risk of future CAD events equivalent to those with existing disease and significantly improving risk discrimination and reclassification. Across multiethnic, external validation datasets inclusive of 33,096, 124,467, 16,433 and 16,874 participants of African, European, Hispanic and South Asian ancestry, respectively, GPSMult demonstrated increased strength of associations across all ancestries and outperformed all available previously published CAD polygenic scores. These data contribute a new GPSMult for CAD to the field and provide a generalizable framework for how large-scale integration of genetic association data for CAD and related traits from diverse populations can meaningfully improve polygenic risk prediction.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.