The introduction of insertion-deletions (INDELs) by non-homologous end-joining (NHEJ) pathway underlies the mechanistic basis of CRISPR-Cas9-directed genome editing. Selective gene ablation using CRISPR-Cas9 is achieved by installation of a premature termination codon (PTC) from a frameshift-inducing INDEL that elicits nonsense-mediated decay (NMD) of the mutant mRNA. Here, by examining the mRNA and protein products of CRISPR targeted genes in a cell line panel with presumed gene knockouts, we detect the production of foreign mRNAs or proteins in ~50% of the cell lines. We demonstrate that these aberrant protein products stem from the introduction of INDELs that promote internal ribosomal entry, convert pseudo-mRNAs (alternatively spliced mRNAs with a PTC) into protein encoding molecules, or induce exon skipping by disruption of exon splicing enhancers (ESEs). Our results reveal challenges to manipulating gene expression outcomes using INDEL-based mutagenesis and strategies useful in mitigating their impact on intended genome-editing outcomes.
The pathogenesis of ulcerative colitis (UC), a major type of inflammatory bowel disease, remains unknown. No model exists that adequately recapitulates the complexity of clinical UC. Here, we take advantage of induced pluripotent stem cells (iPSCs) to develop an induced human UC-derived organoid (iHUCO) model and compared it with the induced human normal organoid model (iHNO). Notably, iHUCOs recapitulated histological and functional features of primary colitic tissues, including the absence of acidic mucus secretion and aberrant adherens junctions in the epithelial barrier both in vitro and in vivo. We demonstrate that the CXCL8/CXCR1 axis was overexpressed in iHUCO but not in iHNO. As proof-of-principle, we show that inhibition of CXCL8 receptor by the small-molecule non-competitive inhibitor repertaxin attenuated the progression of UC phenotypes in vitro and in vivo. This patient-derived organoid model, containing both epithelial and stromal compartments, will generate new insights into the underlying pathogenesis of UC while offering opportunities to tailor interventions to the individual patient.
Identifying alternative indications for known drugs is important for the pharmaceutical industry. Many computational methods have been proposed for predicting unknown associations between drugs and target proteins associated with diseases. To produce better prediction, researchers should not only develop accurate algorithms but identify good features that reflect intracellular systems. In this paper, we proposed a novel method for exploiting protein localization. We generated localization vectors (LVs) from protein localization and propagated LVs through a protein interaction network to increase the coverage of the localization information. The LVs showed distinct patterns among targets of known drugs as well as independent characteristics compared to existing features. Based on the experimental results, we determined that including LVs improves cross-validation accuracy and, produces better novel predictions with real and independent clinical trial data. Moreover, the propagation of LVs showed a positive result that it can help in increasing the coverage of the prediction results.
Hispanic/Latino gastric adenocarcinoma patients have distinct molecular profiles including a high rate of germline CDH1 mutations
Background Drug sensitivity prediction and drug responsive biomarker selection on high-throughput genomic data is a critical step in drug discovery. Many computational methods have been developed to serve this purpose including several deep neural network models. However, the modular relations among genomic features have been largely ignored in these methods. To overcome this limitation, the role of the gene co-expression network on drug sensitivity prediction is investigated in this study. Methods In this paper, we first introduce a network-based method to identify representative features for drug response prediction by using the gene co-expression network. Then, two graph-based neural network models are proposed and both models integrate gene network information directly into neural network for outcome prediction. Next, we present a large-scale comparative study among the proposed network-based methods, canonical prediction algorithms (i.e., Elastic Net, Random Forest, Partial Least Squares Regression, and Support Vector Regression), and deep neural network models for drug sensitivity prediction. All the source code and processed datasets in this study are available at https://github.com/compbiolabucf/drug-sensitivity-prediction. Results In the comparison of different feature selection methods and prediction methods on a non-small cell lung cancer (NSCLC) cell line RNA-seq gene expression dataset with 50 different drug treatments, we found that (1) the network-based feature selection method improves the prediction performance compared to Pearson correlation coefficients; (2) Random Forest outperforms all the other canonical prediction algorithms and deep neural network models; (3) the proposed graph-based neural network models show better prediction performance compared to deep neural network model; (4) the prediction performance is drug dependent and it may relate to the drug’s mechanism of action. Conclusions Network-based feature selection method and prediction models improve the performance of the drug response prediction. The relations between the genomic features are more robust and stable compared to the correlation between each individual genomic feature and the drug response in high dimension and low sample size genomic datasets.
The aim of this study was to investigate the prognostic value of radiomics signatures derived from 18F-fluorodeoxyglucose (18F-FDG) positron-emission tomography (PET) in patients with colorectal cancer (CRC). From April 2008 to Jan 2014, we identified CRC patients who underwent 18F-FDG-PET before starting any neoadjuvant treatments and surgery. Radiomics features were extracted from the primary lesions identified on 18F-FDG-PET. Patients were divided into a training and validation set by random sampling. A least absolute shrinkage and selection operator Cox regression model was applied for prognostic signature building with progression-free survival (PFS) using the training set. Using the calculated radiomics score, a nomogram was developed, and its clinical utility was assessed in the validation set. A total of 381 patients with surgically resected CRC patients (training set: 228 vs. validation set: 153) were included. In the training set, a radiomics signature labeled as a rad_score was generated using two PET-derived features, such as gray-level run length matrix long-run emphasis (GLRLM_LRE) and gray-level zone length matrix short-zone low-gray-level emphasis (GLZLM_SZLGE). Patients with a high rad_score in the training and validation set had a shorter PFS. Multivariable analysis revealed that the rad_score was an independent prognostic factor in both training and validation sets. A radiomics nomogram, developed using rad_score, nodal stage, and lymphovascular invasion, showed good performance in the calibration curve and comparable predictive power with the staging system in the validation set. Textural features derived from 18F-FDG-PET images may enable detailed stratification of prognosis in patients with CRC.
Detecting protein complexes is one of essential and fundamental tasks in understanding various biological functions or processes. Therefore, precise identification of protein complexes is indispensible. For more precise detection of protein complexes, we propose a novel data structure which employs bottleneck proteins as partitioning points for detecting the protein complexes. The partitioning process allows overlapping between resulting protein complexes. We applied our algorithm to several PPI (Protein-Protein Interaction) networks of Saccharomyces cerevisiae and Homo sapiens, and validated our results using public databases of protein complexes. Our algorithm resulted in overlapping protein complexes with significantly improved F1 score, which comes from higher precision.
The introduction of insertion-deletions (INDELs) by activation of the error-prone non-homologous end-joining (NHEJ) pathway underlies the mechanistic basis of CRISPR/Cas9-directed genome editing. The ability of CRISPR/Cas9 to achieve gene elimination (knockouts) is largely attributed to the emergence of a pre-mature termination codon (PTC) from a frameshift-inducing INDEL that elicits non-sense mediated decay (NMD) of the mutant mRNA. Yet, the impact on gene expression as a consequence of CRISPR/Cas9-introduced INDELs into RNA regulatory sequences has been largely left uninvestigated. By tracking DNA-mRNA-protein relationships in a collection of CRISPR/Cas9-edited cell lines that harbor frameshift-inducing INDELs in various targeted genes, we detected the production of foreign mRNAs or proteins in ∼50% of the cell lines. We demonstrate that these aberrant protein products are derived from the introduction of INDELs that promote internal ribosomal entry, convert pseudo-mRNAs into protein encoding molecules, or induce exon skipping by disruption of exon splicing enhancers (ESEs). Our results using CRISPR/Cas9-introduced INDELs reveal facets of an epigenetic genome buffering apparatus that likely evolved to mitigate the impact of such mutations introduced by pathogens and aberrant DNA damage repair, and that more recently pose challenges to manipulating gene expression outcomes using INDEL-based mutagenesis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.