2015
DOI: 10.1101/030650
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ADAGE analysis of publicly available gene expression data collections illuminatesPseudomonas aeruginosa-hostinteractions

Abstract: 16The growth in genome-scale assays of gene expression for different species in publicly available 17 databases presents new opportunities for computational methods that aid in hypothesis 18 generation and biological interpretation of these data. Here, we present an unsupervised 19 machine-learning approach, ADAGE (Analysis using Denoising Autoencoders of Gene 20 Expression) and apply it to the interpretation of all of the publicly available gene expression data 21for Pseudomonas aeruginosa, an important oppor… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 68 publications
(62 reference statements)
0
5
0
Order By: Relevance
“…Dimension reduction methods like principal component analysis (PCA), diffusion maps or t-distributed stochastic neighbor embedding (tSNE) are commonly used to visualize the manifold for gene expression data 19,20 . A number of recent studies describe applications of autoencoders in genomics [21][22][23][24][25] . During denoising, the autoencoder learns the manifold and removes the noise by moving corrupted data points onto the manifold 26 (Fig.…”
Section: Introductionmentioning
confidence: 99%
“…Dimension reduction methods like principal component analysis (PCA), diffusion maps or t-distributed stochastic neighbor embedding (tSNE) are commonly used to visualize the manifold for gene expression data 19,20 . A number of recent studies describe applications of autoencoders in genomics [21][22][23][24][25] . During denoising, the autoencoder learns the manifold and removes the noise by moving corrupted data points onto the manifold 26 (Fig.…”
Section: Introductionmentioning
confidence: 99%
“…Feature Selection: The “curse of dimensionality” is a typical problem when using multimodal data (Tan et al, 2015 ; Nguyen et al, 2020 ). For example, the gene expression data and CNA data in METABRIC dataset contain 24,369 and 22,545 genes, respectively.…”
Section: Methodsmentioning
confidence: 99%
“…Authors in [28] present a Multimodal Deep Neural Network by integrating Multi-dimensional Data (MDNNMD) for diagnosing breast cancer from multidimensional data i.e., gene expression profile data and Copy Number Alteration (CNA) profile data. The small sample size or high dimensionality data may cause bad results [29]. Initially, they select effective features from gene expression profile data that include approximately 24,000 genes and CNA profile data that include approximately 26,000 genes using mRMR method [30].…”
Section: Literature Reviewmentioning
confidence: 99%