Clinical laboratory tests are a critical component of the continuum of care. We evaluate the genetic basis of 35 blood and urine laboratory measurements in the UK Biobank (n=363,228 individuals). We identify 1,857 loci associated with at least one trait, containing 3,374 fine-mapped associations, and additional sets of large-effect (> 0.1 sd) protein-altering, HLA, and copy-number variant associations. Through Mendelian Randomization analysis, we discover 51 causal relationships, including previously known agonistic effects of urate on gout and cystatin C on stroke. Finally, we develop polygenic risk scores for each biomarker and built ‘multi-PRS’ models for diseases using 35 PRSs simultaneously, which improved chronic kidney disease, type 2 diabetes, gout, and alcoholic cirrhosis genetic risk stratification in an independent dataset (FinnGen; n=135,500) relative to single-disease PRSs. Together, our results delineate the genetic basis of biomarkers, their causal influences on diseases, and improve genetic risk stratification for common diseases.
Polygenic risk scores (PRS) have shown promise in predicting susceptibility to common diseases. 1-3 We estimated their added value in clinical risk prediction of five common diseases, using largescale biobank data (FinnGen; N=135,300), and the FINRISK study with clinical risk factors to test genome-wide PRSs for coronary heart disease (CHD), type 2 diabetes (T2D), atrial fibrillation (AF), and breast and prostate cancer. We evaluated the lifetime risk at different PRS levels, and the impact on disease onset and on prediction together with clinical risk scores. Compared to average PRS, having a high PRS contributed to 21% to 38% higher lifetime risk, and 4 to 9 years earlier disease onset. PRS improved model discrimination over age and sex in T2D, AF, breast cancer, and prostate cancer, and over clinical risk in T2D, breast cancer, and prostate cancer. In all diseases, PRS improved reclassification over clinical thresholds, with largest net reclassification improvements for early-onset CHD, AF, and prostate cancer. This study provides evidence for the additional value of PRS in clinical disease prediction. The practical applications of polygenic risk information for stratified screening or for guiding lifestyle and medical interventions in the clinical setting remain to be defined in further studies.
Exome sequencing studies have generally been underpowered to identify deleterious alleles with a large effect on complex traits, as such alleles are mostly rare. Because the population of northern and eastern Finland has expanded dramatically and in isolation following a series of bottlenecks, it harbors numerous deleterious alleles at relatively high frequency. Capitalizing on this circumstance, we exome sequenced nearly 20,000 individuals from these regions. Exome-wide association studies for 64 quantitative traits clinically relevant to cardiovascular and metabolic disease identified 26 newly associated deleterious alleles. Nineteen of these alleles are either unique to or >20 times more frequent in Finns than in other Europeans and show geographical clustering comparable to Mendelian disease mutations characteristic of the Finnish population. We estimate that sequencing studies in populations without this unique history would require hundreds of thousands to millions of participants to achieve comparable association power.
Understanding genetic architecture of plasma lipidome could provide better insights into lipid metabolism and its link to cardiovascular diseases (CVDs). Here, we perform genome-wide association analyses of 141 lipid species (n = 2,181 individuals), followed by phenome-wide scans with 25 CVD related phenotypes (n = 511,700 individuals). We identify 35 lipid-species-associated loci (P <5 ×10−8), 10 of which associate with CVD risk including five new loci-COL5A1, GLTPD2, SPTLC3, MBOAT7 and GALNT16 (false discovery rate<0.05). We identify loci for lipid species that are shown to predict CVD e.g., SPTLC3 for CER(d18:1/24:1). We show that lipoprotein lipase (LPL) may more efficiently hydrolyze medium length triacylglycerides (TAGs) than others. Polyunsaturated lipids have highest heritability and genetic correlations, suggesting considerable genetic regulation at fatty acids levels. We find low genetic correlations between traditional lipids and lipid species. Our results show that lipidomic profiles capture information beyond traditional lipids and identify genetic variants modifying lipid levels and risk of CVD.
SummaryBiobanks are being established across the world to understand the genetic, environmental, and epidemiological basis of human diseases with the goal of better prevention and treatments. Genome-wide association studies (GWAS) have been very successful at mapping genomic loci for a wide range of human diseases and traits, but in general, lack appropriate representation of diverse ancestries - with most biobanks and preceding GWAS studies composed of individuals of European ancestries. Here, we introduce the Global Biobank Meta-analysis Initiative (GBMI) -- a collaborative network of 19 biobanks from 4 continents representing more than 2.1 million consented individuals with genetic data linked to electronic health records. GBMI meta-analyzes summary statistics from GWAS generated using harmonized genotypes and phenotypes from member biobanks. GBMI brings together results from GWAS analysis across 6 main ancestry groups: approximately 33,000 of African ancestry either from Africa or from admixed-ancestry diaspora (AFR), 18,000 admixed American (AMR), 31,000 Central and South Asian (CSA), 341,000 East Asian (EAS), 1.4 million European (EUR), and 1,600 Middle Eastern (MID) individuals. In this flagship project, we generated GWASs from across 14 exemplar diseases and endpoints, including both common and less prevalent diseases that were previously understudied. Using the genetic association results, we validate that GWASs conducted in biobanks worldwide can be successfully integrated despite heterogeneity in case definitions, recruitment strategies, and baseline characteristics between biobanks. We demonstrate the value of this collaborative effort to improve GWAS power for diseases, increase representation, benefit understudied diseases, and improve risk prediction while also enabling the nomination of disease genes and drug candidates by incorporating gene and protein expression data and providing insight into the underlying biology of the studied traits.
Despite the great success of genome-wide association studies (GWAS) in identifying genetic loci significantly associated with diseases, the vast majority of causal variants underlying disease-associated loci have not been identified. To create an atlas of causal variants, we performed and integrated fine-mapping across 148 complex traits in three large-scale biobanks (BioBank Japan, FinnGen, and UK Biobank; total n = 811,261), resulting in 4,518 variant-trait pairs with high posterior probability (> 0.9) of causality. Of these, we found 285 high-confidence variant-trait pairs replicated across multiple populations, and we characterized multiple contributors to the surprising lack of overlap among fine-mapping results from different biobanks. By studying the bottlenecked Finnish and Japanese populations, we identified 21 and 26 putative causal coding variants with extreme allele frequency enrichment (> 10-fold) in these two populations, respectively. Aggregating data across populations enabled identification of 1,492 unique fine-mapped coding variants and 176 genes in which multiple independent coding variants influence the same trait (i.e., with an allelic series of coding variants). Our results demonstrate that fine-mapping in diverse populations enables novel insights into the biology of complex traits by pinpointing high-confidence causal variants for further characterization.
Although genetic risk scores have been used to predict hypertension, their utility in the clinical setting remains uncertain. Our study comprised N=218 792 FinnGen participants (mean age 58 years, 56% women) and N=22 624 well-phenotyped FINRISK participants (mean age 50 years, 53% women). We used public genome-wide association data to compute polygenic risk scores (PRSs) for systolic and diastolic blood pressure (BP). Using time-to-event analysis, we then assessed (1) the association of BP PRSs with hypertension and cardiovascular disease (CVD) in FinnGen and (2) the improvement in model discrimination when combining BP PRSs with the validated 4- and 10-year clinical risk scores for hypertension and CVD in FINRISK. In FinnGen, compared with having a 20 to 80 percentile range PRS, a PRS in the highest 2.5% conferred 2.3-fold (95% CI, 2.2–2.4) risk of hypertension and 10.6 years (95% CI, 9.9–11.4) earlier hypertension onset. In subgroup analyses, this risk was only 1.6-fold (95% CI, 1.5–1.7) for late-onset hypertension (age ≥55 years) but 2.8-fold (95% CI, 2.6–2.9) for early-onset hypertension (age <55 years). Elevated systolic BP PRS also conferred 1.3-fold (95% CI, 1.2–1.4) risk of CVD and 2.3 years (95% CI, 1.6–3.1) earlier onset. In FINRISK, systolic and diastolic BP PRSs improved clinical risk prediction of hypertension (but not CVD), increasing the C statistics by 0.7% (95% CI, 0.3–1.1). We demonstrate that genetic information improves hypertension risk prediction. BP PRSs together with traditional risk factors could improve prediction of hypertension and particularly early-onset hypertension, which confers substantial CVD risk.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.