Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.
Polygenic Scores (PSs) describe the genetic component of an individual's quantitative phenotype or their susceptibility to diseases with a genetic basis. Currently, PSs rely on population-dependent contributions of many associated alleles, with limited applicability to understudied populations and recently admixed individuals. Here we introduce a combination of local ancestry deconvolution and partial PS computation to account for the populationspecific nature of the association signals in individuals with admixed ancestry. We demonstrate partial PS to be a proxy for the total PS and that a portion of the genome is enough to improve susceptibility predictions for the traits we test. By combining partial PSs from different populations, we are able to improve trait predictability in admixed individuals with some European ancestry. These results may extend the applicability of PSs to subjects with a complex history of admixture, where current methods cannot be applied.
Aims/hypothesis Type 2 diabetes mellitus is a major health burden disproportionately affecting those with lower educational attainment (EA). We aimed to obtain causal estimates of the association between EA and type 2 diabetes and to quantify mediating effects of known modifiable risk factors. Methods We applied two-step, two-sample multivariable Mendelian randomisation (MR) techniques using SNPs as genetic instruments for exposure and mediators, thereby minimising bias due to confounding and reverse causation. We leveraged summary data on genome-wide association studies for EA, proposed mediators (i.e. BMI, blood pressure, smoking, television watching) and type 2 diabetes. The total effect of EA on type 2 diabetes was decomposed into a direct effect and indirect effects through multiple mediators. Additionally, traditional mediation analysis was performed in a subset of the National Health and Nutrition Examination Survey 2013–2014. Results EA was inversely associated with type 2 diabetes (OR 0.53 for each 4.2 years of schooling; 95% CI 0.49, 0.56). Individually, the largest contributors were BMI (51.18% mediation; 95% CI 46.39%, 55.98%) and television watching (50.79% mediation; 95% CI 19.42%, 82.15%). Combined, the mediators explained 83.93% (95% CI 70.51%, 96.78%) of the EA–type 2 diabetes association. Traditional analysis yielded smaller effects but showed consistent direction and priority ranking of mediators. Conclusions/interpretation These results support a potentially causal protective effect of EA against type 2 diabetes, with considerable mediation by a number of modifiable risk factors. Interventions on these factors thus have the potential of substantially reducing the burden of type 2 diabetes attributable to low EA. Graphical abstract
Highlights d Ancient groups differentially contributed to complex traits in contemporary Europeans d In contemporary Estonians 11 out of 27 traits show association with some ancestry d Hunter-Gatherer and Yamnaya ancestries divergently influence cholesterol levels d Post-admixture selection is not necessary to have traitancestry associations
The spread of carbapenemase-producing Enterobacteriaceae is a global problem; however, no exact data on the epidemiology of carbapenemase in the Baltic countries and St. Petersburg area is available. We aimed to evaluate the epidemiology of carbapenemase-producing Escherichia coli and Klebsiella pneumoniae in the Baltic States and St. Petersburg, Russia, and to compare the different methods for carbapenemase detection. From January to May 2012, all K. pneumoniae (n = 1983) and E. coli (n = 7774) clinical isolates from 20 institutions in Estonia, Latvia, Lithuania, and St. Petersburg, Russia were screened for carbapenem susceptibility. The IMP, VIM, GIM, NDM, KPC, and OXA-48 genes were detected using real-time PCR and the ability to hydrolyze ertapenem was determined using MALDI-TOF MS. Seventy-seven strains were found to be carbapenem nonsusceptible. From these, 15 K. pneumoniae strains hydrolyzed ertapenem and carried the bla NDM gene. All of these strains carried integron 1 and most carried integron 3 as well as genes of the CTX-M-1 group. No carbapenemase-producing E. coli or K. pneumoniae strains were found in Estonia, Latvia, or Lithuania; however, NDM-positive K. pneumoniae was present in the hospital in St. Petersburg, Russia. A MALDI-TOF MS-based assay is a suitable and cost-effective method for the initial confirmation of carbapenemase production.
The current paradigm of personalized medicine envisages the use of genomic data to provide predictive information on the health course of an individual with the aim of prevention and individualized care. However, substantial efforts are required to realize the concept: enhanced genetic discoveries, translation into intervention strategies, and a systematic implementation in healthcare. Here we review how further genetic discoveries are improving personalized prediction and advance functional insights into the link between genetics and disease. In the second part we give our perspective on the way these advances in genomic research will transform the future of personalized prevention and medicine using Estonia as a primer.
As many cases of type 2 diabetes (T2D) are likely to remain undiagnosed, better tools for early detection of high‐risk individuals are needed to prevent or postpone the disease. We investigated the value of the doubly weighted genetic risk score (dwGRS) for the prediction of incident T2D in the Lifelines and Estonian Biobank (EstBB) cohorts. The dwGRS uses an additional weight for each single nucleotide polymorphism in the risk score, to correct for “Winner's curse” bias in the effect size estimates. The traditional (single‐weighted genetic risk score; swGRS) and dwGRS were calculated for participants in Lifelines ( n = 12,018) and EstBB ( n = 34,129). The dwGRS was found to have stronger association with incident T2D (hazard ratio [HR] = 1.26 [95% confidence interval: 1.10–1.43] and HR = 1.35 [1.28–1.42]) compared to the swGRS (HR = 1.21 [1.07–1.38] and HR = 1.25 [1.19–1.32]) in Lifelines and EstBB, respectively. Comparing the 5‐year predicted risks from the models with and without the dwGRS, the continuous net reclassification index was 0.140 (0.034–0.243; p = .009 Lifelines), and 0.257 (0.194–0.319; p < 2 × 10 −16 EstBB). The dwGRS provided incremental value to the T2D prediction model with established phenotypic predictors. It clearly distinguished the risk groups for incident T2D in both biobanks thereby showing its clinical relevance.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.