Linear functional organization of the omic embedding space

Xenos, Alexandros; Malod-Dognin, Noël; Milinković, S.A.; Pržulj, Nataša

doi:10.1093/bioinformatics/btab487

Cited by 4 publications

(5 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Following Xenos et. al [43] and Doria-Belenguer et. al [25], we use the Deepwalk closed formula by Qiu et.…”

Section: Methodsmentioning

confidence: 91%

“…This formula can be interpreted as a diffusion process that captures higher-order proximities between the nodes in the network; hence, the PPMI matrix is a richer representation than the adjacency matrix [43]. As demonstrated by Xenos et al [43] and Doria-Belenguer et al [25], the extra information encoded in PPMI matrices leads to embedding spaces that better functionally organize the vectorial representation of both genes and gene functions than those generated by using the adjacency matrix.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

The axes of biology: a novel axes-based network embedding paradigm to decipher the functional mechanisms of the cell

Doria-Belenguer

Xenos

Ceddia

et al. 2023

Preprint

View full text Add to dashboard Cite

Common approaches for deciphering biological networks involve network embedding algorithms. These approaches strictly focus on clustering the genes’ embedding vectors and interpreting such clusters to reveal the hidden information of the networks. However, the difficulty in interpreting the genes’ clusters and the limitations of the functional annotations’ resources hinder the identification of the currently unknown cell’s functioning mechanisms. Thus, we propose a new approach that shifts this functional exploration from the embedding vectors of genes in space to the axes of the space itself. Our methodology better disentangles biological information from the embedding space than the classic gene-centric approach. Moreover, it uncovers new data-driven functional interactions that are unregistered in the functional ontologies, but biologically coherent. Furthermore, we exploit these interactions to define new higher-level annotations that we term Axes-Specific Functional Annotations and validate them through literature curation. Finally, we leverage our methodology to discover evolutionary connections between cellular functions and the evolution of species.

show abstract

“…Following Xenos et. al [43] and Doria-Belenguer et. al [25], we use the Deepwalk closed formula by Qiu et.…”

Section: Methodsmentioning

confidence: 91%

Section: Methodsmentioning

confidence: 99%

The axes of biology: a novel axes-based network embedding paradigm to decipher the functional mechanisms of the cell

Doria-Belenguer

Xenos

Ceddia

et al. 2023

Preprint

View full text Add to dashboard Cite

show abstract

“…We represent the tissue-specific PPI networks with their positive point-wise mutual information (PPMI) matrices, X , where each entry in the matrix contains information about how frequently two nodes co-occur in a random walk in the corresponding PPI network. Following Xenos et al (2021) , we use the DeepWalk closed formula by Perozzi et al (2014) with its default settings, which uses 10 iterations, to compute the PPMI matrix. This formula can be interpreted as a diffusion process that captures high-order proximities between the nodes in the network; hence, PPMI is a richer representation than the adjacency matrix ( Xenos et al 2021 ).…”

Section: Methodsmentioning

confidence: 99%

“…Following Xenos et al (2021) , we use the DeepWalk closed formula by Perozzi et al (2014) with its default settings, which uses 10 iterations, to compute the PPMI matrix. This formula can be interpreted as a diffusion process that captures high-order proximities between the nodes in the network; hence, PPMI is a richer representation than the adjacency matrix ( Xenos et al 2021 ). As a result of the extra information encoded in the PPMI, its corresponding embedding spaces better capture the functional organization of the cell than the ones generated by using the adjacency matrix (the details of this comparison are presented in Supplementary Section S1.2.1 ).…”

Section: Methodsmentioning

confidence: 99%

“…One of the advantages of NMTF over deep neural network-based ML approaches is that it requires way fewer parameters to tune, thanks to the careful modeling of the relationships between the data points that it takes as input. As shown by Xenos et al (2021) , the molecular network embedding space produced by NMTF can have valuable properties, e.g. orthonormality, that may lead to an easier interpretation and deeper scientific insight ( Isokääntä et al 2020 ).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A functional analysis of omic network embedding spaces reveals key altered functions in cancer

et al. 2023

Self Cite

View full text Add to dashboard Cite

Motivation Advances in omics technologies have revolutionized cancer research by producing massive datasets. Common approaches to deciphering these complex data are by embedding algorithms of molecular interaction networks. These algorithms find a low-dimensional space in which similarities between the network nodes are best preserved. Currently available embedding approaches mine the gene embeddings directly to uncover new cancer-related knowledge. However, these gene-centric approaches produce incomplete knowledge, since they do not account for the functional implications of genomic alterations. We propose a new, function-centric perspective and approach, to complement the knowledge obtained from omic data. Results We introduce our Functional Mapping Matrix to explore the functional organization of different tissue-specific and species-specific embedding spaces generated by a Non-negative Matrix Tri-Factorization algorithm. Also, we use our FMM to define the optimal dimensionality of these molecular interaction network embedding spaces. For this optimal dimensionality, we compare the FMMs of the most prevalent cancers in human to FMMs of their corresponding control tissues. We find that cancer alters the positions in the embedding space of cancer-related functions, while it keeps the positions of the non-cancer-related ones. We exploit this spacial “movement” to predict novel cancer-related functions. Finally, we predict novel cancer-related genes that the currently available methods for gene-centric analyses cannot identify; we validate these predictions by literature curation and retrospective analyses of patient survival data. Availability Data and source code can be accessed at https://github.com/gaiac/FMM Supplementary information Supplementary data are available at Bioinformatics online.

show abstract

The axes of biology: a novel axes-based network embedding paradigm to decipher the functional mechanisms of the cell

Doria-Belenguer,

Xenos,

Ceddia

et al. 2024

Bioinformatics Advances

View full text Add to dashboard Cite

Common approaches for deciphering biological networks involve network embedding algorithms. These approaches strictly focus on clustering the genes’ embedding vectors and interpreting such clusters to reveal the hidden information of the networks. However, the difficulty in interpreting the genes’ clusters and the limitations of the functional annotations’ resources hinder the identification of the currently unknown cell’s functioning mechanisms. We propose a new approach that shifts this functional exploration from the embedding vectors of genes in space to the axes of the space itself. Our methodology better disentangles biological information from the embedding space than the classic gene-centric approach. Moreover, it uncovers new data-driven functional interactions that are unregistered in the functional ontologies, but biologically coherent. Furthermore, we exploit these interactions to define new higher-level annotations that we term Axes-Specific Functional Annotations and validate them through literature curation. Finally, we leverage our methodology to discover evolutionary connections between cellular functions and the evolution of species. Availability Data and source code can be accessed at https://gitlab.bsc.es/sdoria/axes-of-biology.git Supplementary information Supplementary data are available online.

show abstract

Linear functional organization of the omic embedding space

Cited by 4 publications

References 41 publications

The axes of biology: a novel axes-based network embedding paradigm to decipher the functional mechanisms of the cell

The axes of biology: a novel axes-based network embedding paradigm to decipher the functional mechanisms of the cell

A functional analysis of omic network embedding spaces reveals key altered functions in cancer

The axes of biology: a novel axes-based network embedding paradigm to decipher the functional mechanisms of the cell

Contact Info

Product

Resources

About