Polygenic risk scores have shown great promise in predicting complex disease risk and will become more accurate as training sample sizes increase. The standard approach for calculating risk scores involves linkage disequilibrium (LD)-based marker pruning and applying a p value threshold to association statistics, but this discards information and can reduce predictive accuracy. We introduce LDpred, a method that infers the posterior mean effect size of each marker by using a prior on effect sizes and LD information from an external reference panel. Theory and simulations show that LDpred outperforms the approach of pruning followed by thresholding, particularly at large sample sizes. Accordingly, predicted R(2) increased from 20.1% to 25.3% in a large schizophrenia dataset and from 9.8% to 12.0% in a large multiple sclerosis dataset. A similar relative improvement in accuracy was observed for three additional large disease datasets and for non-European schizophrenia samples. The advantage of LDpred over existing methods will grow as sample sizes increase.
SummaryBackgroundLow-risk limits recommended for alcohol consumption vary substantially across different national guidelines. To define thresholds associated with lowest risk for all-cause mortality and cardiovascular disease, we studied individual-participant data from 599 912 current drinkers without previous cardiovascular disease.MethodsWe did a combined analysis of individual-participant data from three large-scale data sources in 19 high-income countries (the Emerging Risk Factors Collaboration, EPIC-CVD, and the UK Biobank). We characterised dose–response associations and calculated hazard ratios (HRs) per 100 g per week of alcohol (12·5 units per week) across 83 prospective studies, adjusting at least for study or centre, age, sex, smoking, and diabetes. To be eligible for the analysis, participants had to have information recorded about their alcohol consumption amount and status (ie, non-drinker vs current drinker), plus age, sex, history of diabetes and smoking status, at least 1 year of follow-up after baseline, and no baseline history of cardiovascular disease. The main analyses focused on current drinkers, whose baseline alcohol consumption was categorised into eight predefined groups according to the amount in grams consumed per week. We assessed alcohol consumption in relation to all-cause mortality, total cardiovascular disease, and several cardiovascular disease subtypes. We corrected HRs for estimated long-term variability in alcohol consumption using 152 640 serial alcohol assessments obtained some years apart (median interval 5·6 years [5th–95th percentile 1·04–13·5]) from 71 011 participants from 37 studies.FindingsIn the 599 912 current drinkers included in the analysis, we recorded 40 310 deaths and 39 018 incident cardiovascular disease events during 5·4 million person-years of follow-up. For all-cause mortality, we recorded a positive and curvilinear association with the level of alcohol consumption, with the minimum mortality risk around or below 100 g per week. Alcohol consumption was roughly linearly associated with a higher risk of stroke (HR per 100 g per week higher consumption 1·14, 95% CI, 1·10–1·17), coronary disease excluding myocardial infarction (1·06, 1·00–1·11), heart failure (1·09, 1·03–1·15), fatal hypertensive disease (1·24, 1·15–1·33); and fatal aortic aneurysm (1·15, 1·03–1·28). By contrast, increased alcohol consumption was log-linearly associated with a lower risk of myocardial infarction (HR 0·94, 0·91–0·97). In comparison to those who reported drinking >0–≤100 g per week, those who reported drinking >100–≤200 g per week, >200–≤350 g per week, or >350 g per week had lower life expectancy at age 40 years of approximately 6 months, 1–2 years, or 4–5 years, respectively.InterpretationIn current drinkers of alcohol in high-income countries, the threshold for lowest risk of all-cause mortality was about 100 g/week. For cardiovascular disease subtypes other than myocardial infarction, there were no clear risk thresholds below which lower alcohol consumption stopped being ...
Genome-wide association studies (GWAS) and fine-mapping efforts to date have identified more than 100 prostate cancer (PrCa)-susceptibility loci. We meta-analyzed genotype data from a custom high-density array of 46,939 PrCa cases and 27,910 controls of European ancestry with previously genotyped data of 32,255 PrCa cases and 33,202 controls of European ancestry. Our analysis identified 62 novel loci associated (P < 5.0 × 10) with PrCa and one locus significantly associated with early-onset PrCa (≤55 years). Our findings include missense variants rs1800057 (odds ratio (OR) = 1.16; P = 8.2 × 10; G>C, p.Pro1054Arg) in ATM and rs2066827 (OR = 1.06; P = 2.3 × 10; T>G, p.Val109Gly) in CDKN1B. The combination of all loci captured 28.4% of the PrCa familial relative risk, and a polygenic risk score conferred an elevated PrCa risk for men in the ninetieth to ninety-ninth percentiles (relative risk = 2.69; 95% confidence interval (CI): 2.55-2.82) and first percentile (relative risk = 5.71; 95% CI: 5.04-6.48) risk stratum compared with the population average. These findings improve risk prediction, enhance fine-mapping, and provide insight into the underlying biology of PrCa.
The incidence of acute myeloid leukaemia (AML) increases with age and mortality exceeds 90% when diagnosed after age 65. Most cases arise without any detectable early symptoms and patients usually present with the acute complications of bone marrow failure. The onset of such de novo AML cases is typically preceded by the accumulation of somatic mutations in preleukaemic haematopoietic stem and progenitor cells (HSPCs) that undergo clonal expansion. However, recurrent AML mutations also accumulate in HSPCs during ageing of healthy individuals who do not develop AML, a phenomenon referred to as age-related clonal haematopoiesis (ARCH). Here we use deep sequencing to analyse genes that are recurrently mutated in AML to distinguish between individuals who have a high risk of developing AML and those with benign ARCH. We analysed peripheral blood cells from 95 individuals that were obtained on average 6.3 years before AML diagnosis (pre-AML group), together with 414 unselected age- and gender-matched individuals (control group). Pre-AML cases were distinct from controls and had more mutations per sample, higher variant allele frequencies, indicating greater clonal expansion, and showed enrichment of mutations in specific genes. Genetic parameters were used to derive a model that accurately predicted AML-free survival; this model was validated in an independent cohort of 29 pre-AML cases and 262 controls. Because AML is rare, we also developed an AML predictive model using a large electronic health record database that identified individuals at greater risk. Collectively our findings provide proof-of-concept that it is possible to discriminate ARCH from pre-AML many years before malignant transformation. This could in future enable earlier detection and monitoring, and may help to inform intervention.
Prostate cancer is the most frequently diagnosed cancer in males in developed countries. To identify common prostate cancer susceptibility alleles, we genotyped 211,155 SNPs on a custom Illumina array (iCOGS) in blood DNA from 25,074 prostate cancer cases and 24,272 controls from the international PRACTICAL Consortium. Twenty-three new prostate cancer susceptibility loci were identified at genome-wide significance (P < 5 × 10−8). More than 70 prostate cancer susceptibility loci, explaining ~30% of the familial risk for this disease, have now been identified. On the basis of combined risks conferred by the new and previously known risk loci, the top 1% of the risk distribution has a 4.7-fold higher risk than the average of the population being profiled. These results will facilitate population risk stratification for clinical studies.
An understanding of the etiologic heterogeneity of ovarian cancer is important for improving prevention, early detection, and therapeutic approaches. We evaluated 14 hormonal, reproductive, and lifestyle factors by histologic subtype in the Ovarian Cancer Cohort Consortium (OC3).Patients and Methods Among 1.3 million women from 21 studies, 5,584 invasive epithelial ovarian cancers were identified (3,378 serous, 606 endometrioid, 331 mucinous, 269 clear cell, 1,000 other). By using competingrisks Cox proportional hazards regression stratified by study and birth year and adjusted for age, parity, and oral contraceptive use, we assessed associations for all invasive cancers by histology. Heterogeneity was evaluated by likelihood ratio test. ResultsMost risk factors exhibited significant heterogeneity by histology. Higher parity was most strongly associated with endometrioid (relative risk [RR] per birth, 0.78; 95% CI, 0.74 to 0.83) and clear cell (RR, 0.68; 95% CI, 0.61 to 0.76) carcinomas (P value for heterogeneity [P-het] , .001). Similarly, age at menopause, endometriosis, and tubal ligation were only associated with endometrioid and clear cell tumors (P-het # .01). Family history of breast cancer (P-het = .008) had modest heterogeneity. Smoking was associated with an increased risk of mucinous (RR per 20 pack-years, 1.26; 95% CI, 1.08 to 1.46) but a decreased risk of clear cell (RR, 0.72; 95% CI, 0.55 to 0.94) tumors (P-het = .004). Unsupervised clustering by risk factors separated endometrioid, clear cell, and low-grade serous carcinomas from high-grade serous and mucinous carcinomas. ConclusionThe heterogeneous associations of risk factors with ovarian cancer subtypes emphasize the importance of conducting etiologic studies by ovarian cancer subtypes. Most established risk factors were more strongly associated with nonserous carcinomas, which demonstrate challenges for risk prediction of serous cancers, the most fatal subtype.
To identify common alleles associated with different histotypes of epithelial ovarian cancer (EOC), we pooled data from multiple genome-wide genotyping projects totaling 25,509 EOC cases and 40,941 controls. We identified nine new susceptibility loci for different EOC histotypes: six for serous EOC histotypes (3q28, 4q32.3, 8q21.11, 10q24.33, 18q11.2 and 22q12.1), two for mucinous EOC (3q22.3, 9q31.1) and one for endometrioid EOC (5q12.3). We then meta-analysed the results for high-grade serous ovarian cancer with the results from analysis of 31,448 BRCA1 and BRCA2 mutation carriers, including 3,887 mutation carriers with EOC. This identified an additional three loci at 2q13, 8q24.1 and 12q24.31. Integrated analyses of genes and regulatory biofeatures at each locus predicted candidate susceptibility genes, including OBFC1, a novel susceptibility gene for low grade/borderline serous EOC.
Objective: To assess the epidemiological evidence on diet and cancer and make public health recommendations. Design: Review of published studies, concentrating on recent systematic reviews, meta-analyses and large prospective studies. Conclusions and recommendations: Overweight/obesity increases the risk for cancers of the oesophagus (adenocarcinoma), colorectum, breast (postmenopausal), endometrium and kidney; body weight should be maintained in the body mass index range of 18.5-25 kg/m 2 , and weight gain in adulthood avoided. Alcohol causes cancers of the oral cavity, pharynx, oesophagus and liver, and a small increase in the risk for breast cancer; if consumed, alcohol intake should not exceed 2 units/d. Aflatoxin in foods causes liver cancer, although its importance in the absence of hepatitis virus infections is not clear; exposure to aflatoxin in foods should be minimised. Chinese-style salted fish increases the risk for nasopharyngeal cancer, particularly if eaten during childhood, and should be eaten only in moderation. Fruits and vegetables probably reduce the risk for cancers of the oral cavity, oesophagus, stomach and colorectum, and diets should include at least 400 g/d of total fruits and vegetables. Preserved meat and red meat probably increase the risk for colorectal cancer; if eaten, consumption of these foods should be moderate. Salt preserved foods and high salt intake probably increase the risk for stomach cancer; overall consumption of salt preserved foods and salt should be moderate. Very hot drinks and foods probably increase the risk for cancers of the oral cavity, pharynx and oesophagus; drinks and foods should not be consumed when they are scalding hot. Physical activity, the main determinant of energy expenditure, reduces the risk for colorectal cancer and probably reduces the risk for breast cancer; regular physical activity should be taken.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.