SummaryEducational attainment (EA) is strongly influenced by social and other environmental factors, but genetic factors are also estimated to account for at least 20% of the variation across individuals1. We report the results of a genome-wide association study (GWAS) for EA that extends our earlier discovery sample1,2 of 101,069 individuals to 293,723 individuals, and a replication in an independent sample of 111,349 individuals from the UK Biobank. We now identify 74 genome-wide significant loci associated with number of years of schooling completed. Single-nucleotide polymorphisms (SNPs) associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioral phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because EA is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric disease.
We conducted genome-wide association studies of three phenotypes: subjective well-being (N = 298,420), depressive symptoms (N = 161,460), and neuroticism (N = 170,910). We identified three variants associated with subjective well-being, two with depressive symptoms, and eleven with neuroticism, including two inversion polymorphisms. The two depressive symptoms loci replicate in an independent depression sample. Joint analyses that exploit the high genetic correlations between the phenotypes (|ρ̂| ≈ 0.8) strengthen the overall credibility of the findings, and allow us to identify additional variants. Across our phenotypes, loci regulating expression in central nervous system and adrenal/pancreas tissues are strongly enriched for association.
Aims The aim of this study was to develop, validate, and illustrate an updated prediction model (SCORE2) to estimate 10-year fatal and non-fatal cardiovascular disease (CVD) risk in individuals without previous CVD or diabetes aged 40–69 years in Europe. Methods and results We derived risk prediction models using individual-participant data from 45 cohorts in 13 countries (677 684 individuals, 30 121 CVD events). We used sex-specific and competing risk-adjusted models, including age, smoking status, systolic blood pressure, and total- and HDL-cholesterol. We defined four risk regions in Europe according to country-specific CVD mortality, recalibrating models to each region using expected incidences and risk factor distributions. Region-specific incidence was estimated using CVD mortality and incidence data on 10 776 466 individuals. For external validation, we analysed data from 25 additional cohorts in 15 European countries (1 133 181 individuals, 43 492 CVD events). After applying the derived risk prediction models to external validation cohorts, C-indices ranged from 0.67 (0.65–0.68) to 0.81 (0.76–0.86). Predicted CVD risk varied several-fold across European regions. For example, the estimated 10-year CVD risk for a 50-year-old smoker, with a systolic blood pressure of 140 mmHg, total cholesterol of 5.5 mmol/L, and HDL-cholesterol of 1.3 mmol/L, ranged from 5.9% for men in low-risk countries to 14.0% for men in very high-risk countries, and from 4.2% for women in low-risk countries to 13.7% for women in very high-risk countries. Conclusion SCORE2—a new algorithm derived, calibrated, and validated to predict 10-year risk of first-onset CVD in European populations—enhances the identification of individuals at higher risk of developing CVD across Europe.
Background The relevance of blood lipid concentrations to long-term incidence of cardiovascular disease and the relevance of lipid-lowering therapy for cardiovascular disease outcomes is unclear. We investigated the cardiovascular disease risk associated with the full spectrum of bloodstream non-HDL cholesterol concentrations. We also created an easy-to-use tool to estimate the long-term probabilities for a cardiovascular disease event associated with non-HDL cholesterol and modelled its risk reduction by lipid-lowering treatment. Methods In this risk-evaluation and risk-modelling study, we used Multinational Cardiovascular Risk Consortium data from 19 countries across Europe, Australia, and North America. Individuals without prevalent cardiovascular disease at baseline and with robust available data on cardiovascular disease outcomes were included. The primary composite endpoint of atherosclerotic cardiovascular disease was defined as the occurrence of the coronary heart disease event or ischaemic stroke. Sex-specific multivariable analyses were computed using non-HDL cholesterol categories according to the European guideline thresholds, adjusted for age, sex, cohort, and classical modifiable cardiovascular risk factors. In a derivation and validation design, we created a tool to estimate the probabilities of a cardiovascular disease event by the age of 75 years, dependent on age, sex, and risk factors, and the associated modelled risk reduction, assuming a 50% reduction of non-HDL cholesterol. Findings Of the 524 444 individuals in the 44 cohorts in the Consortium database, we identified 398 846 individuals belonging to 38 cohorts (184 055 [48•7%] women; median age 51•0 years [IQR 40•7-59•7]). 199 415 individuals were included in the derivation cohort (91 786 [48•4%] women) and 199 431 (92 269 [49•1%] women) in the validation cohort. During a maximum follow-up of 43•6 years (median 13•5 years, IQR 7•0-20•1), 54 542 cardiovascular endpoints occurred. Incidence curve analyses showed progressively higher 30-year cardiovascular disease eventrates for increasing non-HDL cholesterol categories (from 7•7% for non-HDL cholesterol <2•6 mmol/L to 33•7% for ≥5•7 mmol/L in women and from 12•8% to 43•6% in men; p<0•0001). Multivariable adjusted Cox models with non-HDL cholesterol lower than 2•6 mmol/L as reference showed an increase in the association between non-HDL cholesterol concentration and cardiovascular disease for both sexes (from hazard ratio 1•1, 95% CI 1•0-1•3 for non-HDL cholesterol 2•6 to <3•7 mmol/L to 1•9, 1•6-2•2 for ≥5•7 mmol/L in women and from 1•1, 1•0-1•3 to 2•3, 2•0-2•5 in men). The derived tool allowed the estimation of cardiovascular disease event probabilities specific for non-HDL cholesterol with high comparability between the derivation and validation cohorts as reflected by smooth calibration curves analyses and a root mean square error lower than 1% for the estimated probabilities of cardiovascular disease. A 50% reduction of non-HDL cholesterol concentrations was associated with reduced risk of...
We conduct a genome-wide association study (GWAS) of educational attainment (EA) in a sample of ~3 million individuals and identify 3,952 approximately uncorrelated genome-wide-significant single-nucleotide polymorphisms (SNPs). A genome-wide polygenic predictor, or polygenic index (PGI), explains 12–16% of EA variance and contributes to risk prediction for ten diseases. Direct effects (i.e., controlling for parental PGIs) explain roughly half the PGI’s magnitude of association with EA and other phenotypes. The correlation between mate-pair PGIs is far too large to be consistent with phenotypic assortment alone, implying additional assortment on PGI-associated factors. In an additional GWAS of dominance deviations from the additive model, we identify no genome-wide-significant SNPs, and a separate X-chromosome additive GWAS identifies 57.
The German National Cohort (GNC) is a joint interdisciplinary endeavour of scientists from the Helmholtz and the Leibniz Association, universities, and other research institutes. Its aim is to investigate the causes for the development of major chronic diseases, i.e. cardiovascular diseases, cancer, diabetes, neurodegenerative/-psychiatric diseases, musculoskeletal diseases, respiratory and infectious diseases, and their pre-clinical stages or functional health impairments. Across Germany, a random sample of the general population will be drawn by 18 regional study centres, including a total of 100,000 women and 100,000 men aged 20–69 years. The baseline assessments include an extensive interview and self-completion questionnaires, a wide range of medical examinations and the collection of various biomaterials. In a random subgroup of 20 % of the participants (n = 40,000) an intensified examination (“Level 2”) programme will be performed. In addition, in five of the 18 study centres a total of 30,000 study participants will take part in a magnetic resonance imaging examination programme, and all of these participants will also be offered the intensified Level 2 examinations. After 4–5 years, all participants will be invited for a re-assessment. Information about chronic disease endpoints will be collected through a combination of active follow-up (including questionnaires every 2–3 years) and record linkages. The GNC is planned for an overall duration of 25–30 years. It will provide a major, central resource for population-based epidemiology in Germany, and will help to identify new and tailored strategies for early detection, prediction, and primary prevention of major diseases.
Population-based whole-body MR imaging as part of the German National Cohort will serve to compile a comprehensive image repository, will provide insight into physiologic variants and subclinical disease burden, and has the potential to enable identification of novel imaging biomarkers of risk.
Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.