Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57–1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628–0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.
BACKGROUNDGenetic testing for breast cancer susceptibility is widely used, but for many genes, evidence of an association with breast cancer is weak, underlying risk estimates are imprecise, and reliable subtype-specific risk estimates are lacking. METHODSWe used a panel of 34 putative susceptibility genes to perform sequencing on samples from 60,466 women with breast cancer and 53,461 controls. In separate analyses for protein-truncating variants and rare missense variants in these genes, we estimated odds ratios for breast cancer overall and tumor subtypes. We evaluated missense-variant associations according to domain and classification of pathogenicity. RESULTSProtein-truncating variants in 5 genes (ATM, BRCA1, BRCA2, CHEK2, and PALB2) were associated with a risk of breast cancer overall with a P value of less than 0.0001. Protein-truncating variants in 4 other genes (BARD1, RAD51C, RAD51D, and TP53) were associated with a risk of breast cancer overall with a P value of less than 0.05 and a Bayesian false-discovery probability of less than 0.05. For protein-truncating variants in 19 of the remaining 25 genes, the upper limit of the 95% confidence interval of the odds ratio for breast cancer overall was less than 2.0. For protein-truncating variants in ATM and CHEK2, odds ratios were higher for estrogen receptor (ER)-positive disease than for ER-negative disease; for protein-truncating variants in BARD1, BRCA1, BRCA2, PALB2, RAD51C, and RAD51D, odds ratios were higher for ER-negative disease than for ER-positive disease. Rare missense variants (in aggregate) in ATM, CHEK2, and TP53 were associated with a risk of breast cancer overall with a P value of less than 0.001. For BRCA1, BRCA2, and TP53, missense variants (in aggregate) that would be classified as pathogenic according to standard criteria were associated with a risk of breast cancer overall, with the risk being similar to that of protein-truncating variants. CONCLUSIONSThe results of this study define the genes that are most clinically useful for inclusion on panels for the prediction of breast cancer risk, as well as provide estimates of the risks associated with protein-truncating variants, to guide genetic counseling. (Funded by European Union Horizon 2020 programs and others.
SNP18 added substantial information to risk assessment based on the Tyrer-Cuzick model and mammographic density. A combined risk is likely to aid risk-stratified screening and prevention strategies.
Breast cancer susceptibility variants frequently show heterogeneity in associations by tumor subtype. To identify novel loci, we performed a genome-wide association study (GWAS) including 133,384 breast cancer cases and 113,789 controls, plus 18,908 BRCA1 mutation carriers (9,414 with breast cancer) of European ancestry, using both standard and novel methodologies that account for underlying tumor heterogeneity by estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2) status and tumor grade. We identified 32 novel susceptibility loci (P<5.0x10 -8 ), 15 of which showed evidence for associations with at least one tumor feature (false discovery rate <0.05). Five loci showed associations (P<0.05) in opposite directions between luminal-and non-luminal subtypes. In-silico analyses showed these five loci contained cell-specific enhancers that differed between normal luminal and basal mammary cells. The genetic correlations between five intrinsic-like subtypes ranged from 0.35 to 0.80. The proportion of genome-wide chip heritability explained by all known susceptibility loci was 37.6% for triple-negative and 54.2% for luminal A-like disease. These findings provide an improved understanding of genetic predisposition to breast cancer subtypes and will inform the development of subtype-specific polygenic risk scores.
Background We examined the associations between germline variants and breast cancer mortality using a large meta-analysis of women of European ancestry. Methods Meta-analyses included summary estimates based on Cox models of twelve datasets using ~10.4 million variants for 96,661 women with breast cancer and 7697 events (breast cancer-specific deaths). Oestrogen receptor (ER)-specific analyses were based on 64,171 ER-positive (4116) and 16,172 ER-negative (2125) patients. We evaluated the probability of a signal to be a true positive using the Bayesian false discovery probability (BFDP). Results We did not find any variant associated with breast cancer-specific mortality at P < 5 × 10 −8 . For ER-positive disease, the most significantly associated variant was chr7:rs4717568 (BFDP = 7%, P = 1.28 × 10 −7 , hazard ratio [HR] = 0.88, 95% confidence interval [CI] = 0.84–0.92); the closest gene is AUTS2 . For ER-negative disease, the most significant variant was chr7:rs67918676 (BFDP = 11%, P = 1.38 × 10 −7 , HR = 1.27, 95% CI = 1.16–1.39); located within a long intergenic non-coding RNA gene (AC004009.3), close to the HOXA gene cluster. Conclusions We uncovered germline variants on chromosome 7 at BFDP < 15% close to genes for which there is biological evidence related to breast cancer outcome. However, the paucity of variants associated with mortality at genome-wide significance underpins the challenge in providing genetic-based individualised prognostic information for breast cancer patients.
Pathogenic variants in BRCA1 or BRCA2 are identified in ∼20% of families with multiple individuals affected by early-onset breast and/or ovarian cancer. Extensive searches for additional highly penetrant genes or alternative mutational mechanisms altering BRCA1 or BRCA2 have not explained the missing heritability. Here, we report a dominantly inherited 5' UTR variant associated with epigenetic BRCA1 silencing due to promoter hypermethylation in two families affected by breast and ovarian cancer. BRCA1 promoter methylation of ten CpG dinucleotides in families who are affected by breast and/or ovarian cancer but do not have germline BRCA1 or BRCA2 pathogenic variants was assessed by pyrosequencing and clonal bisulfite sequencing. RNA and DNA sequencing of BRCA1 from lymphocytes was undertaken to establish allelic expression and the presence of germline variants. BRCA1 promoter hypermethylation was identified in 2 of 49 families in which multiple women are affected by grade 3 breast cancer or high-grade serous ovarian cancer. Soma-wide BRCA1 promoter hypermethylation was confirmed in blood, buccal mucosa, and hair follicles. Pyrosequencing showed that DNA was ∼50% methylated, consistent with the silencing of one allele, which was confirmed by clonal bisulfite sequencing. RNA sequencing revealed the allelic loss of BRCA1 expression in both families and that this loss of expression segregated with the heterozygous variant c.-107A>T in the BRCA1 5' UTR. Our results establish a mechanism whereby familial breast and ovarian cancer is caused by an in cis 5' UTR variant associated with epigenetic silencing of the BRCA1 promoter in two independent families. We propose that methylation analyses be undertaken to establish the frequency of this mechanism in families affected by early-onset breast and/or ovarian cancer without a BRCA1 or BRCA2 pathogenic variant.
Panels of single nucleotide polymorphisms (SNPs) stratify risk for breast cancer in women from the general population, but studies are needed assess their use in a fully comprehensive model including classical risk factors, mammographic density and more than 100 SNPs associated with breast cancer. A case–control study was designed (1,668 controls, 405 cases) in women aged 47–73 years attending routine screening in Manchester UK, and enrolled in a wider study to assess methods for risk assessment. Risk from classical questionnaire risk factors was assessed using the Tyrer–Cuzick model; mean percentage visual mammographic density was scored by two independent readers. DNA extracted from saliva was genotyped at selected SNPs using the OncoArray. A predefined polygenic risk score based on 143 SNPs was calculated (SNP143). The odds ratio (OR, and 95% confidence interval, CI) per interquartile range (IQ‐OR) of SNP143 was estimated unadjusted and adjusted for Tyrer–Cuzick and breast density. Secondary analysis assessed risk by oestrogen receptor (ER) status. The primary polygenic risk score was well calibrated (O/E OR 1.10, 95% CI 0.86–1.34) and accuracy was retained after adjustment for Tyrer–Cuzick risk and mammographic density (IQ‐OR unadjusted 2.12, 95% CI% 1.75–2.42; adjusted 2.06, 95% CI 1.75–2.42). SNP143 was a risk factor for ER+ and ER− breast cancer (adjusted IQ‐OR, ER+ 2.11, 95% CI 1.78–2.51; ER− 1.81, 95% CI 1.16–2.84). In conclusion, polygenic risk scores based on a large number of SNPs improve risk stratification in combination with classical risk factors and mammographic density, and SNP143 was similarly predictive for ER‐positive and ER‐negative disease.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.