Biocomputing 2018 2017
DOI: 10.1142/9789813235533_0008
|View full text |Cite
|
Sign up to set email alerts
|

Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders

Abstract: The Cancer Genome Atlas (TCGA) has profiled over 10,000 tumors across 33 different cancertypes for many genomic features, including gene expression levels. Gene expression measurements capture substantial information about the state of each tumor. Certain classes of deep neural network models are capable of learning a meaningful latent space. Such a latent space could be used to explore and generate hypothetical gene expression profiles under various types of molecular and genetic perturbation. For example, on… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

3
150
0
1

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
2
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 153 publications
(154 citation statements)
references
References 20 publications
3
150
0
1
Order By: Relevance
“…Autoencoders have been used for learning representations and analysing transcriptomic cancer data before. In particular, our work relates to Way and Greene (2018), since it employs VAEs for constructing latent representations and analysing transcriptomic cancer data. The authors show that VAEs can be utilised for knowledge extraction from gene expression pan-cancer TCGA data (TCGA et al, 2013), thus reducing the dimensionality of the single, homogeneous data source while still being able to identify patterns related to different cancer types.…”
Section: Discussionmentioning
confidence: 99%
“…Autoencoders have been used for learning representations and analysing transcriptomic cancer data before. In particular, our work relates to Way and Greene (2018), since it employs VAEs for constructing latent representations and analysing transcriptomic cancer data. The authors show that VAEs can be utilised for knowledge extraction from gene expression pan-cancer TCGA data (TCGA et al, 2013), thus reducing the dimensionality of the single, homogeneous data source while still being able to identify patterns related to different cancer types.…”
Section: Discussionmentioning
confidence: 99%
“…3) After training the second DAE, genes with high weights in this DAE are selected based on a standard deviation filter on their connectivity weights. This selection approach is similar to (Danaee et al, 2016;Tan et al, 2015Tan et al, , 2016Tan et al, , 2017Way and Greene, 2018). The idea of this gene selection step is to provide the classifier with a few rich genes with the strongest signals based on both the labelled and unlabelled datasets.…”
Section: Methodsmentioning
confidence: 99%
“…In the previous studies the value of this threshold for similar purposes was typically set to 2 (Danaee et al, 2016;Tan et al, 2015Tan et al, , 2016 or 2.5 (Tan et al, 2017;Way and Greene, 2018).…”
Section: Dhw Genesmentioning
confidence: 99%
See 1 more Smart Citation
“…While deep learning has been applied in many domains such as speech recognition, image recognition, natural language processing, its application in analyzing genomic data is very limited. Way et al 17 applied variational autoencoders (VAEs), an unsupervised deep neural network approach to analyze TCGA gene expression profiling data. Specifically, the extent to which a VAE can be trained to model cancer gene expression, and whether or not such a VAE would capture biologically relevant features were evaluated.…”
Section: Disease Genes and Pathwaysmentioning
confidence: 99%