High-risk neuroblastoma is a very aggressive disease, with excessive tumor growth and poor outcomes. A proper stratification of the high-risk patients by prognostic outcome is important for treatment. However, there is still a lack of survival stratification for the high-risk neuroblastoma. To fill the gap, we adopt a deep learning algorithm, Autoencoder, to integrate multi-omics data, and combine it with K-means clustering to identify two subtypes with significant survival differences. By comparing the Autoencoder with PCA, iCluster, and DGscore about the classification based on multi-omics data integration, Autoencoder-based classification outperforms the alternative approaches. Furthermore, we also validated the classification in two independent datasets by training machine-learning classification models, and confirmed its robustness. Functional analysis revealed that MYCN amplification was more frequently occurred in the ultra-high-risk subtype, in accordance with the overexpression of MYC/MYCN targets in this subtype. In summary, prognostic subtypes identified by deep learning-based multi-omics integration could not only improve our understanding of molecular mechanism, but also help the clinicians make decisions.
Background Gene copy number variations (CNVs) contribute to genetic diversity and disease prevalence across populations. Substantial efforts have been made to decipher the relationship between CNVs and pathogenesis but with limited success. Results We have developed a novel computational framework X-CNV (www.unimd.org/XCNV), to predict the pathogenicity of CNVs by integrating more than 30 informative features such as allele frequency (AF), CNV length, CNV type, and some deleterious scores. Notably, over 14 million CNVs across various ethnic groups, covering nearly 93% of the human genome, were unified to calculate the AF. X-CNV, which yielded area under curve (AUC) values of 0.96 and 0.94 in training and validation sets, was demonstrated to outperform other available tools in terms of CNV pathogenicity prediction. A meta-voting prediction (MVP) score was developed to quantitively measure the pathogenic effect, which is based on the probabilistic value generated from the XGBoost algorithm. The proposed MVP score demonstrated a high discriminative power in determining pathogenetic CNVs for inherited traits/diseases in different ethnic groups. Conclusions The ability of the X-CNV framework to quantitatively prioritize functional, deleterious, and disease-causing CNV on a genome-wide basis outperformed current CNV-annotation tools and will have broad utility in population genetics, disease-association studies, and diagnostic screening.
Background Glioma is the most commonly diagnosed malignant and aggressive brain cancer in adults. Traditional researches mainly explored the expression profile of glioma at cell-population level, but ignored the heterogeneity and interactions of among glioma cells. Methods Here, we firstly analyzed the single-cell RNA-seq (scRNA-seq) data of 6341 glioma cells using manifold learning and identified neoplastic and healthy cells infiltrating in tumor microenvironment. We systematically revealed cell-to-cell interactions inside gliomas based on corresponding scRNA-seq and TCGA RNA-seq data. Results A total of 16 significantly correlated autocrine ligand-receptor signal pairs inside neoplastic cells were identified based on the scRNA-seq and TCGA data of glioma. Furthermore, we explored the intercellular communications between cancer stem-like cells (CSCs) and macrophages, and identified 66 ligand-receptor pairs, some of which could significantly affect prognostic outcomes. An efficient machine learning model was constructed to accurately predict the prognosis of glioma patients based on the ligand-receptor interactions. Conclusion Collectively, our study not only reveals functionally important cell-to-cell interactions inside glioma, but also detects potentially prognostic markers for predicting the survival of glioma patients. Electronic supplementary material The online version of this article (10.1186/s12964-019-0363-1) contains supplementary material, which is available to authorized users.
Next-generation sequencing (NGS) is being used in clinical testing. Government authorities in both China and the United States are overseeing the clinical application of NGS instruments and reagents. In addition, the US Association for Molecular Pathology and the College of American Pathologists have jointly released a guidance to standardize the analysis and interpretation of NGS data involved in clinical testing. At present, the analysis strategies and pipelines for NGS data related to the clinical detection of pediatric disease are similar to those used for adult diseases. However, for rare pediatric diseases without linkage to known genetic variants, it is currently difficult to detect the relevant pathogenic genes using NGS technology. Additionally, it is challenging to identify novel pathogenic genes of familial pediatric tumors. Therefore, characterization of the pathogenic genes associated with above diseases is important for the diagnosis and treatment of rare diseases in children. This article introduces the general pipelines for NGS data analyses of diseases and elucidates data analysis strategies for the pathogenic genes of rare pediatric diseases and familial pediatric tumors.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with đź’™ for researchers
Part of the Research Solutions Family.