Enhancer sequences control gene expression and comprise binding sites (motifs) for different transcription factors (TFs). Despite extensive genetic and computational studies, the relationship between DNA sequence and regulatory activity is poorly understood and enhancer de novo design is considered impossible. Here we built a deep learning model, DeepSTARR, to quantitatively predict the activities of thousands of developmental and housekeeping enhancers directly from DNA sequence in Drosophila melanogaster S2 cells. The model learned relevant TF motifs and higher-order syntax rules, including functionally non-equivalent instances of the same TF motif that are determined by motif-flanking sequence and inter-motif distances. We validated these rules experimentally and demonstrated their conservation in human by testing more than 40,000 wildtype and mutant Drosophila and human enhancers. Finally, we designed and functionally validated synthetic enhancers with desired activities de novo..
Centrosomes are the major microtubule organising centres of animal cells. Deregulation in their number occurs in cancer and was shown to trigger tumorigenesis in mice. However, the incidence, consequence and origins of this abnormality are poorly understood. Here, we screened the NCI-60 panel of human cancer cell lines to systematically analyse centriole number and structure. Our screen shows that centriole amplification is widespread in cancer cell lines and highly prevalent in aggressive breast carcinomas. Moreover, we identify another recurrent feature of cancer cells: centriole size deregulation. Further experiments demonstrate that severe centriole over-elongation can promote amplification through both centriole fragmentation and ectopic procentriole formation. Furthermore, we show that overly long centrioles form over-active centrosomes that nucleate more microtubules, a known cause of invasiveness, and perturb chromosome segregation. Our screen establishes centriole amplification and size deregulation as recurrent features of cancer cells and identifies novel causes and consequences of those abnormalities.
Mutations causing aberrant splicing are frequently implicated in human diseases including cancer. Here, we establish a high-throughput screen of randomly mutated minigenes to decode the cis-regulatory landscape that determines alternative splicing of exon 11 in the proto-oncogene MST1R (RON). Mathematical modelling of splicing kinetics enables us to identify more than 1000 mutations affecting RON exon 11 skipping, which corresponds to the pathological isoform RON∆165. Importantly, the effects correlate with RON alternative splicing in cancer patients bearing the same mutations. Moreover, we highlight heterogeneous nuclear ribonucleoprotein H (HNRNPH) as a key regulator of RON splicing in healthy tissues and cancer. Using iCLIP and synergy analysis, we pinpoint the functionally most relevant HNRNPH binding sites and demonstrate how cooperative HNRNPH binding facilitates a splicing switch of RON exon 11. Our results thereby offer insights into splicing regulation and the impact of mutations on alternative splicing in cancer.
BackgroundBreast cancer is a highly heterogeneous disease resulting in diverse clinical behaviours and therapeutic responses. DNA methylation is a major epigenetic alteration that is commonly perturbed in cancers. The aim of this study is to characterize the relationship between DNA methylation and aberrant gene expression in breast cancer.MethodsWe analysed DNA methylation and gene expression profiles from breast cancer tissue and matched normal tissue in The Cancer Genome Atlas (TCGA). Genome-wide differential methylation analysis and methylation-gene expression correlation was performed. Gene expression changes were subsequently validated in the METABRIC dataset. The Oncoscore tool was used to identify genes that had previously been associated with cancer in the literature. A subset of genes that had not previously been studied in cancer was chosen for further analysis.ResultsWe identified 368 CpGs that were differentially methylated between tumor and normal breast tissue (∆β > 0.4). Hypermethylated CpGs were overrepresented in tumor tissue and were found predominantly (56%) in upstream promoter regions. Conversely, hypomethylated CpG sites were found primarily in the gene body (66%). Expression analysis revealed that 209 of the differentially-methylated CpGs were located in 169 genes that were differently expressed between normal and breast tumor tissue. Methylation-expression correlations were predominantly negative (70%) for promoter CpG sites and positive (74%) for gene body CpG sites. Among these differentially-methylated and differentially-expressed genes, we identified 7 that had not previously been studied in any form of cancer. Three of these, TDRD10, PRAC2 and TMEM132C, contained CpG sites that showed diagnostic and prognostic value in breast cancer, particularly in estrogen-receptor (ER)-positive samples. A pan-cancer analysis confirmed differential expression of these genes together with diagnostic and prognostic value of their respective CpG sites in multiple cancer types.ConclusionWe have identified 368 DNA methylation changes that characterize breast cancer tumor tissue, of which 209 are associated with genes that are differentially-expressed in the same samples. Novel DNA methylation markers were identified, of which cg12374721 (PRAC2), cg18081940 (TDRD10) and cg04475027 (TMEM132C) show promise as diagnostic and prognostic markers in breast cancer as well as other cancer types.Electronic supplementary materialThe online version of this article (10.1186/s12885-019-5403-0) contains supplementary material, which is available to authorized users.
Centrosome amplification (CA) is a common feature of human tumours and a promising target for cancer therapy. However, CA’s pan-cancer prevalence, molecular role in tumourigenesis and therapeutic value in the clinical setting are still largely unexplored. Here, we used a transcriptomic signature (CA20) to characterise the landscape of CA-associated gene expression in 9,721 tumours from The Cancer Genome Atlas (TCGA). CA20 is upregulated in cancer and associated with distinct clinical and molecular features of breast cancer, consistently with our experimental CA quantification in patient samples. Moreover, we show that CA20 upregulation is positively associated with genomic instability, alteration of specific chromosomal arms and C>T mutations, and we propose novel molecular players associated with CA in cancer. Finally, high CA20 is associated with poor prognosis and, by integrating drug sensitivity with drug perturbation profiles in cell lines, we identify candidate compounds for selectively targeting cancer cells exhibiting transcriptomic evidence for CA.
The use of computational tools to identify biological targets of natural products with anticancer properties and unknown modes of action is gaining momentum. We employed self-organizing maps to deconvolute the phenotypic effects of piperlongumine (PL) and establish a link to modulation of the human transient receptor potential vanilloid 2 (hTRPV2) channel. The structure of the PL-bound full-length rat TRPV2 channel was determined by cryo-EM. PL binds to a transient allosteric pocket responsible for a new mode of anticancer activity against glioblastoma (GBM) in which hTRPV2 is overexpressed. Calcium imaging experiments revealed the importance of Arg539 and Thr522 residues on the antagonistic effect of PL and calcium influx modulation of the TRPV2 channel. Downregulation of hTRPV2 reduces sensitivity to PL and decreases ROS production. Analysis of GBM patient samples associates hTRPV2 overexpression with tumor grade, disease progression, and poor prognosis. Extensive tumor abrogation and long term survival was achieved in two murine models of orthotopic GBM by formulating PL in an implantable scaffold/hydrogel for sustained local therapy. Furthermore, in primary tumor samples derived from GBM patients, we observed a selective reduction of malignant cells in response to PL ex vivo . Our results establish a broadly applicable strategy, leveraging data-motivated research hypotheses for the discovery of novel means tackling cancer.
The role of RANKL-RANK pathway in progesterone-driven mammary carcinogenesis and triple negative breast cancer tumorigenesis has been well characterized. However, and despite evidences of the existence of RANK-positive hormone receptor (HR)positive breast tumors, the implication of RANK expression in HR-positive breast cancers has not been addressed before. Here, we report that RANK pathway affects the expression of cell cycle regulators and decreases sensitivity to fulvestrant of estrogen receptor (ER)-positive (ER+)/HER2-breast cancer cells, MCF-7 and T47D. Moreover, RANK overexpressing cells had a staminal and mesenchymal phenotype, with decreased proliferation rate and decreased susceptibility to chemotherapy, but were more invasive in vivo. In silico analysis of the transcriptome of human breast tumors, confirmed the association between RANK expression and stem cell and mesenchymal markers in ER+HER2-tumors. Importantly, exposure of ER+HER2cells to continuous RANK pathway activation by exogenous RANKL, in vitro and in vivo, induced a negative feedback effect, independent of RANK levels, leading to the downregulation of HR and increased resistance to hormone therapy. These results suggest that ER+HER2-RANK-positive cells may constitute an important reservoir of slow cycling, therapy-resistance cancer cells; and that RANK pathway activation is deleterious in all ER+HER2-breast cancer cells, independently of RANK levels.
Enhancer sequences control gene expression and comprise binding sites (motifs) for different transcription factors (TFs). Despite extensive genetic and computational studies, the relationship between DNA sequence and regulatory activity is poorly understood and enhancer de novo design is considered impossible. Here we built a deep learning model, DeepSTARR, to quantitatively predict the activities of thousands of developmental and housekeeping enhancers directly from DNA sequence in Drosophila melanogaster S2 cells. The model learned relevant TF motifs and higher-order syntax rules, including functionally non-equivalent instances of the same TF motif that are determined by motif-flanking sequence and inter-motif distances. We validated these rules experimentally and demonstrated their conservation in human by testing more than 40,000 wildtype and mutant Drosophila and human enhancers. Finally, we designed and functionally validated synthetic enhancers with desired activities de novo.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.