2019
DOI: 10.1093/bioinformatics/btz769
|View full text |Cite
|
Sign up to set email alerts
|

Deep-learning approach to identifying cancer subtypes using high-dimensional genomic data

Abstract: Motivation Cancer subtype classification has the potential to significantly improve disease prognosis and develop individualized patient management. Existing methods are limited by their ability to handle extremely high-dimensional data and by the influence of misleading, irrelevant factors, resulting in ambiguous and overlapping subtypes. Results To address the above issues, we proposed a novel approach to disentangling and … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
58
0
1

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 101 publications
(70 citation statements)
references
References 38 publications
1
58
0
1
Order By: Relevance
“…The formulation and the architecture of the clustering network are built upon DeepType [25], which also uses representation, classification, and clustering modules. Different from DeepType, DeepCrossCancer’s clustering network contains an additional survival module.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The formulation and the architecture of the clustering network are built upon DeepType [25], which also uses representation, classification, and clustering modules. Different from DeepType, DeepCrossCancer’s clustering network contains an additional survival module.…”
Section: Methodsmentioning
confidence: 99%
“…We use the cross-entropy loss to quantify the discrepancy between the correct cancer type of the patient and the predicted cancer types of the patient. As in [25], we impose an £ 1 regularization [27] on the weight matrix of the first layer to control the model complexity. The cross-entropy and sparsity losses are defined as: …”
Section: Methodsmentioning
confidence: 99%
“…AEs have been also successfully applied on gene-expression or multi-omics data to tackle more complex and biologically-relevant cancer prediction tasks, such as classifying cancer sub-types or characterizing functional gene profiles [22][23][24]. Feed-forward MLNNs have been recently utilized to predict clinical outcomes from high-dimensional genomic data [25] or to identify cancer sub-types by combining supervised and unsupervised learning [26].…”
Section: Plos Onementioning
confidence: 99%
“…On these data sets, we compared ClusterATAC with four state-ofart clustering algorithms: K-means, spectral clustering (Spectral), autoencoder (AE), variational autoencoder (VAE). K-means and Spectral are the two most stable and effective clustering algorithms, while AE and VAE are two deep learning algorithms applied to omics data clustering [29][30][31]. Especially, VAE has been proven by previous work to handle high-dimensional ATAC-seq data [20].…”
Section: Evaluate the Performance Of Clusteratac On Benchmark Data Setsmentioning
confidence: 99%