We developed a variant-annotation method that combines sequence-based machine-learning classification with a context-dependent algorithm for selecting splice variants. Our approach is distinctive in that it compares the splice potential of a sequence bearing a variant with the splice potential of the reference sequence. After training, classification accurately identified 168 of 180 (93.3%) canonical splice sites of five genes. The combined method, CryptSplice, identified and correctly predicted the effect of 18 of 21 (86%) known splice-altering variants in CFTR, a well-studied gene whose loss-of-function variants cause cystic fibrosis (CF). Among 1,423 unannotated CFTR disease-associated variants, the method identified 32 potential exonic cryptic splice variants, two of which were experimentally evaluated and confirmed. After complete CFTR sequencing, the method found three cryptic intronic splice variants (one known and two experimentally verified) that completed the molecular diagnosis of CF in 6 of 14 individuals. CryptSplice interrogation of sequence data from six individuals with X-linked dyskeratosis congenita caused by an unknown disease-causing variant in DKC1 identified two splice-altering variants that were experimentally verified. To assess the extent to which disease-associated variants might activate cryptic splicing, we selected 458 pathogenic variants and 348 variants of uncertain significance (VUSs) classified as high confidence from ClinVar. Splice-site activation was predicted for 129 (28%) of the pathogenic variants and 75 (22%) of the VUSs. Our findings suggest that cryptic splice-site activation is more common than previously thought and should be routinely considered for all variants within the transcribed regions of genes.
Assessment of the functional consequences of variants near splice sites is a major challenge in the diagnostic laboratory. To address this issue, we created expression minigenes (EMGs) to determine the RNA and protein products generated by splice site variants (n = 10) implicated in cystic fibrosis (CF). Experimental results were compared with the splicing predictions of eight in silico tools. EMGs containing the full-length Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) coding sequence and flanking intron sequences generated wild-type transcript and fully processed protein in Human Embryonic Kidney (HEK293) and CF bronchial epithelial (CFBE41o-) cells. Quantification of variant induced aberrant mRNA isoforms was concordant using fragment analysis and pyrosequencing. The splicing patterns of c.1585−1G>A and c.2657+5G>A were comparable to those reported in primary cells from individuals bearing these variants. Bioinformatics predictions were consistent with experimental results for 9/10 variants (MES), 8/10 variants (NNSplice), and 7/10 variants (SSAT and Sroogle). Programs that estimate the consequences of mis-splicing predicted 11/16 (HSF and ASSEDA) and 10/16 (Fsplice and SplicePort) experimentally observed mRNA isoforms. EMGs provide a robust experimental approach for clinical interpretation of splice site variants and refinement of in silico tools.
BackgroundRecent epidemiological studies demonstrate that both active and involuntary exposure to tobacco smoke increase the risk of breast cancer. Little is known, however, about the molecular mechanisms by which continuous, long term exposure to tobacco smoke contributes to breast carcinogenesis because most previous studies have focused on short term treatment models. In this work we have set out to investigate the progressive transforming effects of tobacco smoke on non-tumorigenic mammary epithelial cells and breast cancer cells using in vitro and in vivo models of chronic cigarette smoke exposure.ResultsWe show that both non-tumorigenic (MCF 10A, MCF-12A) and tumorigenic (MCF7) breast epithelial cells exposed to cigarette smoke acquire mesenchymal properties such as fibroblastoid morphology, increased anchorage-independent growth, and increased motility and invasiveness. Moreover, transplantation experiments in mice demonstrate that treatment with cigarette smoke extract renders MCF 10A cells more capable to survive and colonize the mammary ducts and MCF7 cells more prone to metastasize from a subcutaneous injection site, independent of cigarette smoke effects on the host and stromal environment. The extent of transformation and the resulting phenotype thus appear to be associated with the differentiation state of the cells at the time of exposure. Analysis by flow cytometry showed that treatment with CSE leads to the emergence of a CD44hi/CD24low population in MCF 10A cells and of CD44+ and CD49f + MCF7 cells, indicating that cigarette smoke causes the emergence of cell populations bearing markers of self-renewing stem-like cells. The phenotypical alterations induced by cigarette smoke are accompanied by numerous changes in gene expression that are associated with epithelial to mesenchymal transition and tumorigenesis.ConclusionsOur results indicate that exposure to cigarette smoke leads to a more aggressive and transformed phenotype in human mammary epithelial cells and that the differentiation state of the cell at the time of exposure may be an important determinant in the phenotype of the final transformed state.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.