2020
DOI: 10.1093/bioinformatics/btaa488
|View full text |Cite
|
Sign up to set email alerts
|

Factorized embeddings learns rich and biologically meaningful embedding spaces using factorized tensor decomposition

Abstract: Motivation The recent development of sequencing technologies revolutionized our understanding of the inner workings of the cell as well as the way disease is treated. A single RNA sequencing (RNA-Seq) experiment, however, measures tens of thousands of parameters simultaneously. While the results are information rich, data analysis provides a challenge. Dimensionality reduction methods help with this task by extracting patterns from the data by compressing it into compact vector representation… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 28 publications
0
4
0
Order By: Relevance
“…To cope with dataset-specific missing values, we modified the deep tensor factorisation model by Trofimov et al. ( 6 ). We further generated two embeddings from the functional PPI network STRING ( 23 ) using graph embedding algorithms (Materials and Methods).…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…To cope with dataset-specific missing values, we modified the deep tensor factorisation model by Trofimov et al. ( 6 ). We further generated two embeddings from the functional PPI network STRING ( 23 ) using graph embedding algorithms (Materials and Methods).…”
Section: Resultsmentioning
confidence: 99%
“…Studies have demonstrated that functional gene embeddings can be derived from a variety of data types reflecting gene function, including gene expression ( 4–6 ), CRISPR screens ( 7 ), protein sequence ( 8 ), protein–protein interaction networks ( 9–11 ), but also occurrences in scientific literature ( 12 ) and Gene Ontology ( 13 ) annotations ( 14 ). These studies have shown that the obtained embeddings have predictive power for a wide range of downstream prediction tasks including the prediction of disease-gene lists ( 9 , 12 ), Gene Ontology annotations ( 6 , 7 ), gene-phenotype associations ( 15 ), but also gene-gene interactions ( 5 , 11 , 14 ) and subcellular localizations of gene products ( 10 ). These results are appealing because they suggest that precomputed functional gene embeddings could become effective resources for integrating gene function information into machine learning models.…”
Section: Introductionmentioning
confidence: 99%
“…Plus, PCA performs linear transformations, which can be possibly insufficient to model data. Neural networks such as t-SNE (Hinton & Van Der Maaten, 2008) and Factorized Embedding (Trofimov et al, 2020) can learn useful non-linear data-driven representations. Adaptation of these DNNs in practice to learn survival-directed representations with a CPH could theoretically improve performance.…”
Section: Improvements To the Systemsmentioning
confidence: 99%
“…In bioinformatics, embeddings have been used to represent DNA nucleotide sequences [15][16][17][18] , protein sequences 19 , genes 20 , and single-cell Hi-C data 21 , among others. Factorized tensor decomposition has also been used to achieve biologically meaningful representation for RNA-Seq data 22 . Most similar to our use case is the recently published Avocado method, which learns a dense representation for a region using a deep neural network trained on epigenome signal information 23 .…”
Section: Introductionmentioning
confidence: 99%