2019
DOI: 10.1101/855593
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Unsupervised Machine Learning for Data Encoding applied to Ovarian Cancer Transcriptomes

Abstract: Machine learning algorithms are revolutionising how information can be extracted from complex and highdimensional data sets via intelligent compression. For example, unsupervised Autoencoders train a deep neural network with a low-dimensional "bottlenecked" central layer to reconstruct input vectors. Variational Autoencoders (VAEs) have shown promise at learning meaningful latent spaces for text, image and more recently, gene-expression data. In the latter case they have been shown capable of capturing biologi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
7
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(7 citation statements)
references
References 30 publications
0
7
0
Order By: Relevance
“…There are many works which have used VAE in their studies. However, they are mostly mono-omics studies of individual cancer 28,32,33 or pancancer 29,34,35 . OmiVAE 29 is the only work that considered VAE for integrated multi-omics (di-omics) analysis of pancancer.…”
mentioning
confidence: 99%
See 1 more Smart Citation
“…There are many works which have used VAE in their studies. However, they are mostly mono-omics studies of individual cancer 28,32,33 or pancancer 29,34,35 . OmiVAE 29 is the only work that considered VAE for integrated multi-omics (di-omics) analysis of pancancer.…”
mentioning
confidence: 99%
“…Moreover, most of the existing works 28,32,33,35 use unsupervised dimensionality reduction methods, separating the downstream analysis from the reduction method. However, dimensionality reduction in cancer multi-omics analysis is an intermediate step toward the downstream analysis, like classification (e.g., cancer vs normal cell).…”
mentioning
confidence: 99%
“…Multiple studies (An and Cho, 2015; Li and She, 2017; Bouchacourt et al , 2017; Kipf and Welling, 2016) have established the power of the variational autoencoder (VAE; Kingma and Welling (2013); Jimenez Rezende et al (2014))—an unsupervised nonlinear data embedding model with two deep neural networks oppositely connected through a low-dimensional probabilistic latent space—for finding meaningful and useful latent features in high-dimensional data. In the context of cancer bioinformatics, VAEs have been variously used to (i) model cancer gene expression and capture biologically-relevant features using the TCGA Pan-cancer Project RNA-seq dataset (Way and Greene, 2018); (ii) find encodings that correlate with biological features such as patient sex and tumor type (Titus et al , 2018); (iii) find encodings that can be used to predict gene inactivation in cancer (Way and Greene, 2017); and (iv) obtain an encoding that is predictive of chemotherapy resistance (George and Lio, 2019). Based on their exploration of multiple VAE architectures for predicting gene inactivation in a pan-cancer dataset, Way & Greene reported (2017) biological insights obtained from the latent-space embeddings learned by VAEs.…”
Section: Introductionmentioning
confidence: 99%
“…Based on their exploration of multiple VAE architectures for predicting gene inactivation in a pan-cancer dataset, Way & Greene reported (2017) biological insights obtained from the latent-space embeddings learned by VAEs. George and Lio (2019) used a VAE-based, fully unsupervised approach to encode ovarian tumor transcriptomes and obtained latent-space features that were associated with response to chemotherapy. These studies suggest that a tumor transcriptome VAE may be broadly useful for the response-to-chemotherapy prediction problem and they set the stage for the present multi-cancer investigation of the utility of the tumor transcriptome VAE in precision oncology.…”
Section: Introductionmentioning
confidence: 99%
“…Way and Greene [ 28 ] explored VAE architectures for predicting gene inactivation in a pan-cancer dataset and reported biological insights obtained from the latent-space embeddings. George and Lio [ 29 ] used a VAE-based, unsupervised approach to encode tumor transcriptomes to obtain latent-space features associated with chemotherapy response. Dincer et al [ 30 ] used a semi-supervised, VAE-lasso approach to predict drug sensitivity of cancer cells in vitro.…”
Section: Introductionmentioning
confidence: 99%