Single-cell RNA sequencing (scRNA-seq) is increasingly used in studies on gastrointestinal cancers. This study investigated the prognostic value of epithelial cell-associated biomarkers in colorectal cancer (CRC) using scRNA-seq data. We downloaded and analysed scRNA-seq data from four CRC samples from the Gene Expression Omnibus (GEO), and we identified marker genes of malignant epithelial cells (MECs) using CRC transcriptome and clinical data downloaded from The Cancer Genome Atlas (TCGA) and GEO as training and validation cohorts, respectively. In the TCGA training cohort, weighted gene correlation network analysis, univariate Cox proportional hazard model (Cox) analysis, and least absolute shrinkage and selection operator regression analysis were performed on the marker genes of MEC subsets to identify a signature of nine prognostic MEC-related genes (MECRGs) and calculate a risk score based on the signature. CRC patients were divided into high- and low-risk groups according to the median risk score. We found that the MECRG risk score was significantly correlated with the clinical features and overall survival of CRC patients, and that CRC patients in the high-risk group showed a significantly shorter survival time. The univariate and multivariate Cox regression analyses showed that the MECRG risk score can serve as an independent prognostic factor for CRC patients. Gene set enrichment analysis revealed that the MECRG signature genes are involved in fatty acid metabolism, p53 signalling, and other pathways. To increase the clinical application value, we constructed a MECRG nomogram by combining the MECRG risk score with other independent prognostic factors. The validity of the nomogram is based on receiver operating characteristics and calibration curves. The MECRG signature and nomogram models were well validated in the GEO dataset. In conclusion, we established an epithelial cell marker gene-based risk assessment model based on scRNA-seq analysis of CRC samples for predicting the prognosis of CRC patients.
<abstract>
<p>Background: The epithelial-mesenchymal transition (EMT) is associated with gastric cancer (GC) progression and immune microenvironment. To better understand the heterogeneity underlying EMT, we integrated single-cell RNA-sequencing (scRNA-seq) data and bulk sequencing data from GC patients to evaluate the prognostic utility of biomarkers for EMT-related cells (ERCs), namely, cancer-associated fibroblasts (CAFs) and epithelial cells (ECs). Methods: scRNA-seq data from primary GC tumor samples were obtained from the Gene Expression Omnibus (GEO) database to identify ERC marker genes. Bulk GC datasets from the Cancer Genome Atlas (TCGA) and GEO were used as training and validation sets, respectively. Differentially expressed markers were identified from the TCGA database. Univariate Cox, least-absolute shrinkage, and selection operator regression analyses were performed to identify EMT-related cell-prognostic genes (ERCPGs). Kaplan-Meier, Cox regression, and receiver-operating characteristic (ROC) curve analyses were adopted to evaluate the prognostic utility of the ERCPG signature. An ERCPG-based nomogram was constructed by integrating independent prognostic factors. Finally, we evaluated the correlations between the ERCPG signature and immune-cell infiltration and verified the expression of ERCPG prognostic signature genes by in vitro cellular assays. Results: The ERCPG signature was comprised of seven genes (COL4A1, F2R, MMP11, CAV1, VCAN, FKBP10, and APOD). Patients were divided into high- and low-risk groups based on the ERCPG risk scores. Patients in the high-risk group showed a poor prognosis. ROC and calibration curves suggested that the ERCPG signature and nomogram had a good prognostic utility. An immune cell-infiltration analysis suggested that the abnormal expression of ERCPGs induced the formation of an unfavorable tumor immune microenvironment. In vitro cellular assays showed that ERCPGs were more abundantly expressed in GC cell lines compared to normal gastric tissue cell lines. Conclusions: We constructed and validated an ERCPG signature using scRNA-seq and bulk sequencing data from ERCs of GC patients. Our findings support the estimation of patient prognosis and tumor treatment in future clinical practice.</p>
</abstract>
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.