2013
DOI: 10.1214/12-aoas578
|View full text |Cite
|
Sign up to set email alerts
|

Sparse integrative clustering of multiple omics data sets

Abstract: High resolution microarrays and second-generation sequencing platforms are powerful tools to investigate genome-wide alterations in DNA copy number, methylation, and gene expression associated with a disease. An integrated genomic profiling approach measuring multiple omics data types simultaneously in the same set of biological samples would render an integrated data resolution that would not be available with any single data type. In this study, we use penalized latent variable regression methods for joint m… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
89
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 96 publications
(91 citation statements)
references
References 55 publications
(78 reference statements)
0
89
0
Order By: Relevance
“…We build up five numerical examples based on those in [4, 7]. In these examples, for each type of data, clustering information is associated with a small number of functional variants (viewed as features), based on which one cluster can be distinguished from the other two, while the other two are still inseparable.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…We build up five numerical examples based on those in [4, 7]. In these examples, for each type of data, clustering information is associated with a small number of functional variants (viewed as features), based on which one cluster can be distinguished from the other two, while the other two are still inseparable.…”
Section: Resultsmentioning
confidence: 99%
“…It is known that these genome-wide alterations are related to tumorigenesis and treatment response in a complex way [2]. As a result, integrative analysis of these alterations may lead to greater power in detecting tumor subtypes with important biological therapeutic differences [3, 4], as opposed to separate analysis on each data type, followed by a manual or ad hoc integration; this point has been observed in many studies [4, 5, 6, 7, 8]. …”
Section: Introductionmentioning
confidence: 99%
“…Lanckriet et al (2004) propose a two stage approach, first computing a kernel representation for data in each platform and subsequently combining kernels across platforms in a classification model. Mo et al (2013) and Shen et al (2013) proposed a clustering model “iCluster”, which uses a joint latent variable model to cluster samples into tumor subtypes. Through applications to breast and lung cancer data, iCluster identified potential novel tumor subtypes Similarly, Lock et al (2013) proposed an additive decomposition of variation approach consisting of low-rank approximations capturing joint variation across and within platforms, while using orthogonality constraints to ensure that patterns within and across platforms are unrelated.…”
Section: Structure Learning and Integrationmentioning
confidence: 99%
“…Several R/Bioconductor packages have implemented methods to integrate and visualize biological data: PMA [20–22], mixOmics [23–25], made4 [26, 27], RGCCA [28, 29], omicade4 [30, 31], CNAmet [32, 33], RTopper [34, 35], iClusterPlus [36, 37] and STATegRa [38] among others. Each of these packages implements a different strategy to face the integration analysis.…”
Section: Introductionmentioning
confidence: 99%