2019
DOI: 10.3389/fgene.2019.00744
|View full text |Cite
|
Sign up to set email alerts
|

Multi-view Subspace Clustering Analysis for Aggregating Multiple Heterogeneous Omics Data

Abstract: Integration of distinct biological data types could provide a comprehensive view of biological processes or complex diseases. The combinations of molecules responsible for different phenotypes form multiple embedded (expression) subspaces, thus identifying the intrinsic data structure is challenging by regular integration methods. In this paper, we propose a novel framework of “Multi-view Subspace Clustering Analysis (MSCA),” which could measure the local similarities of samples in the same subspace and obtain… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
15
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 14 publications
(15 citation statements)
references
References 35 publications
0
15
0
Order By: Relevance
“…As deep learning techniques have been widely used in the field of bioinformatics and achieved many successes, we added a deep learning-based method, Subtype-GAN [95], into our experiment. We notice that LRAcluster, PFA, and Mul-tiNMF are not originally designed for cancer subtyping problems but these methods represent general frameworks for integrating multi-omics data, which can be used to conduct different downstream analysis including cancer subtyping and have good performances [1,11,12,30]. For evaluation and comparison of more integration methods, we included these three methods in this study.…”
Section: Selection Of Integration Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…As deep learning techniques have been widely used in the field of bioinformatics and achieved many successes, we added a deep learning-based method, Subtype-GAN [95], into our experiment. We notice that LRAcluster, PFA, and Mul-tiNMF are not originally designed for cancer subtyping problems but these methods represent general frameworks for integrating multi-omics data, which can be used to conduct different downstream analysis including cancer subtyping and have good performances [1,11,12,30]. For evaluation and comparison of more integration methods, we included these three methods in this study.…”
Section: Selection Of Integration Methodsmentioning
confidence: 99%
“…To understand and demonstrate the crucial need for addressing the second problem of data type selection, we surveyed 58 integration methods for cancer subtyping proposed from 2009 to 2019, and the result is summarized in Fig 1 where gene expression is treated as the same as mRNA expression and miRNA expression is placed into the group of epigenome based on observations from [7]. We summarized part of these 58 integration methods with the omics data they used in Fig 1A , and we can see, the data combinations used in these methods [2,[8][9][10][11][12][13][14][15][16][17][18][19][20][21][22][23][24] are significantly inconsistent. For example, Fig 1B shows that while the mRNA expression data were used by 56 of the 58 methods, each of the other data types was only used by at most nearly half of these methods.…”
Section: Introductionmentioning
confidence: 99%
“…It is possible to identify the cell type from the whole gene regulatory network perspective (Li et al, 2017;Zheng et al, 2018Zheng et al, , 2019a. Besides, motivated by previous studies (Lan et al, 2018;Chen et al, 2019;Shi et al, 2019), multi-view learning and integrating with prior knowledge are promising directions to improve the accuracy of clustering and give a higher resolution of cell types.…”
Section: Discussionmentioning
confidence: 99%
“…However, these methods may further dilute the already low signal-to-noise ratio and increase the noise pollution to the results. Considering that the sample (patient) size of the biological data is much smaller than the feature (gene) size, some graph-based learning methods for cancer subtype recognition are designed [ 11 , 12 , 13 , 14 , 15 , 16 , 17 ]. These methods use the sample points to quickly construct the similarity graph, which can be converted into the problem of spectral clustering.…”
Section: Introductionmentioning
confidence: 99%