Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57–1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628–0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.
We evaluated whether a 76-locus polygenic risk score (PRS) and Breast Imaging Reporting and Data System (BI-RADS) breast density were independent risk factors within three studies (1643 case patients, 2397 control patients) using logistic regression models. We incorporated the PRS odds ratio (OR) into the Breast Cancer Surveillance Consortium (BCSC) risk-prediction model while accounting for its attributable risk and compared five-year absolute risk predictions between models using area under the curve (AUC) statistics. All statistical tests were two-sided. BI-RADS density and PRS were independent risk factors across all three studies (P interaction = .23). Relative to those with scattered fibroglandular densities and average PRS (2(nd) quartile), women with extreme density and highest quartile PRS had 2.7-fold (95% confidence interval [CI] = 1.74 to 4.12) increased risk, while those with low density and PRS had reduced risk (OR = 0.30, 95% CI = 0.18 to 0.51). PRS added independent information (P < .001) to the BCSC model and improved discriminatory accuracy from AUC = 0.66 to AUC = 0.69. Although the BCSC-PRS model was well calibrated in case-control data, independent cohort data are needed to test calibration in the general population.
Triple-negative (TN) breast cancer is an aggressive subtype of breast cancer associated with a unique set of epidemiologic and genetic risk factors. We conducted a two-stage genome-wide association study of TN breast cancer (stage 1: 1529 TN cases, 3399 controls; stage 2: 2148 cases, 1309 controls) to identify loci that influence TN breast cancer risk. Variants in the 19p13.1 and PTHLH loci showed genome-wide significant associations (P < 5 × 10(-) (8)) in stage 1 and 2 combined. Results also suggested a substantial enrichment of significantly associated variants among the single nucleotide polymorphisms (SNPs) analyzed in stage 2. Variants from 25 of 74 known breast cancer susceptibility loci were also associated with risk of TN breast cancer (P < 0.05). Associations with TN breast cancer were confirmed for 10 loci (LGR6, MDM4, CASP8, 2q35, 2p24.1, TERT-rs10069690, ESR1, TOX3, 19p13.1, RALY), and we identified associations with TN breast cancer for 15 additional breast cancer loci (P < 0.05: PEX14, 2q24.1, 2q31.1, ADAM29, EBF1, TCF7L2, 11q13.1, 11q24.3, 12p13.1, PTHLH, NTN4, 12q24, BRCA2, RAD51L1-rs2588809, MKL1). Further, two SNPs independent of previously reported signals in ESR1 [rs12525163 odds ratio (OR) = 1.15, P = 4.9 × 10(-) (4)] and 19p13.1 (rs1864112 OR = 0.84, P = 1.8 × 10(-) (9)) were associated with TN breast cancer. A polygenic risk score (PRS) for TN breast cancer based on known breast cancer risk variants showed a 4-fold difference in risk between the highest and lowest PRS quintiles (OR = 4.03, 95% confidence interval 3.46-4.70, P = 4.8 × 10(-) (69)). This translates to an absolute risk for TN breast cancer ranging from 0.8% to 3.4%, suggesting that genetic variation may be used for TN breast cancer risk prediction.
The 10q26 locus in the second intron of FGFR2 is the locus most strongly associated with estrogen-receptor-positive breast cancer in genome-wide association studies. We conducted fine-scale mapping in case-control studies genotyped with a custom chip (iCOGS), comprising 41 studies (n = 89,050) of European ancestry, 9 Asian ancestry studies (n = 13,983), and 2 African ancestry studies (n = 2,028) from the Breast Cancer Association Consortium. We identified three statistically independent risk signals within the locus. Within risk signals 1 and 3, genetic analysis identified five and two variants, respectively, highly correlated with the most strongly associated SNPs. By using a combination of genetic fine mapping, data on DNase hypersensitivity, and electrophoretic mobility shift assays to study protein-DNA binding, we identified rs35054928, rs2981578, and rs45631563 as putative functional SNPs. Chromatin immunoprecipitation showed that FOXA1 preferentially bound to the risk-associated allele (C) of rs2981578 and was able to recruit ERα to this site in an allele-specific manner, whereas E2F1 preferentially bound the risk variant of rs35054928. The risk alleles were preferentially found in open chromatin and bound by Ser5 phosphorylated RNA polymerase II, suggesting that the risk alleles are associated with changes in transcription. Chromatin conformation capture demonstrated that the risk region was able to interact with the promoter of FGFR2, the likely target gene of this risk region. A role for FOXA1 in mediating breast cancer susceptibility at this locus is consistent with the finding that the FGFR2 risk locus primarily predisposes to estrogen-receptor-positive disease.
Previous studies have suggested that polymorphisms in CASP8 on chromosome 2 are associated with breast cancer risk. To clarify the role of CASP8 in breast cancer susceptibility, we carried out dense genotyping of this region in the Breast Cancer Association Consortium (BCAC). Single-nucleotide polymorphisms (SNPs) spanning a 1 Mb region around CASP8 were genotyped in 46 450 breast cancer cases and 42 600 controls of European origin from 41 studies participating in the BCAC as part of a custom genotyping array experiment (iCOGS). Missing genotypes and SNPs were imputed and, after quality exclusions, 501 typed and 1232 imputed SNPs were included in logistic regression models adjusting for study and ancestry principal components. The SNPs retained in the final model were investigated further in data from nine genome-wide association studies (GWAS) comprising in total 10 052 case and 12 575 control subjects. The most significant association signal observed in European subjects was for the imputed intronic SNP rs1830298 in ALS2CR12 (telomeric to CASP8), with per allele odds ratio and 95% confidence interval [OR (95% confidence interval, CI)] for the minor allele of 1.05 (1.03-1.07), P = 1 × 10(-5). Three additional independent signals from intronic SNPs were identified, in CASP8 (rs36043647), ALS2CR11 (rs59278883) and CFLAR (rs7558475). The association with rs1830298 was replicated in the imputed results from the combined GWAS (P = 3 × 10(-6)), yielding a combined OR (95% CI) of 1.06 (1.04-1.08), P = 1 × 10(-9). Analyses of gene expression associations in peripheral blood and normal breast tissue indicate that CASP8 might be the target gene, suggesting a mechanism involving apoptosis.
Obesity is a risk factor for triple‐negative breast cancer ( TNBC ) incidence and poor outcomes, but the underlying molecular biology remains unknown. We previously identified in TNBC cell cultures that expression of epigenetic reader methyl‐CpG‐binding domain protein 2 ( MBD 2), specifically the alternative mRNA splicing variant MBD variant 2 ( MBD 2_v2), is dependent on reactive oxygen species ( ROS ) and is crucial for maintenance and expansion of cancer stem cell‐like cells ( CSC s). Because obesity is coupled with inflammation and ROS , we hypothesized that obesity can fuel an increase in MBD 2_v2 expression to promote the tumor‐initiating CSC phenotype in TNBC cells in vivo . Analysis of TNBC patient datasets revealed associations between high tumor MBD 2_v2 expression and high relapse rates and high body mass index ( BMI ). Stable gene knockdown/overexpression methods were applied to TNBC cell lines to elucidate that MBD 2_v2 expression is governed by ROS ‐dependent expression of serine‐ and arginine‐rich splicing factor 2 ( SRSF 2). We employed a diet‐induced obesity ( DIO ) mouse model that mimics human obesity to investigate whether obesity causes increased MBD 2_v2 expression and increased tumor initiation capacity in inoculated TNBC cell lines. MBD 2_v2 and SRSF 2 levels were increased in TNBC cell line‐derived tumors that formed more frequently in DIO mice relative to tumors in lean control mice. Stable MBD 2_v2 overexpression increased the CSC fraction in culture and increased TNBC cell line tumor initiation capacity in vivo . SRSF 2 knockdown resulted in decreased MBD 2_v2 expression, decreased CSC s in TNBC cell cultures, and hindered tumor formation in vivo . This report describes evidence to support the conclusion that MBD 2_v2 expression is induced by obesity and drives TNBC cell tumorigenicity, and thus provides molecular insights into support of the epidemiological evidence that obesity is a risk factor for TNBC . The majority of TNBC patients are obese and rising obesity rates threaten to further increase the burden of obesity‐linked cancers, which reinforces the relevance of this report.
Mammographic density measures adjusted for age and body mass index (BMI) are heritable predictors of breast cancer risk but few mammographic density-associated genetic variants have been identified. Using data for 10,727 women from two international consortia, we estimated associations between 77 common breast cancer susceptibility variants and absolute dense area, percent dense area and absolute non-dense area adjusted for study, age and BMI using mixed linear modeling. We found strong support for established associations between rs10995190 (in the region of ZNF365), rs2046210 (ESR1) and rs3817198 (LSP1) and adjusted absolute and percent dense areas (all p <10−5). Of 41 recently discovered breast cancer susceptibility variants, associations were found between rs1432679 (EBF1), rs17817449 (MIR1972-2: FTO), rs12710696 (2p24.1), and rs3757318 (ESR1) and adjusted absolute and percent dense areas, respectively. There were associations between rs6001930 (MKL1) and both adjusted absolute dense and non-dense areas, and between rs17356907 (NTN4) and adjusted absolute non-dense area. Trends in all but two associations were consistent with those for breast cancer risk. Results suggested that 18% of breast cancer susceptibility variants were associated with at least one mammographic density measure. Genetic variants at multiple loci were associated with both breast cancer risk and the mammographic density measures. Further understanding of the underlying mechanisms at these loci could help identify etiological pathways implicated in how mammographic density predicts breast cancer risk.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.