2014
DOI: 10.1073/pnas.1417808111
|View full text |Cite
|
Sign up to set email alerts
|

Learning regulatory programs by threshold SVD regression

Abstract: We formulate a statistical model for the regulation of global gene expression by multiple regulatory programs and propose a thresholding singular value decomposition (T-SVD) regression method for learning such a model from data. Extensive simulations demonstrate that this method offers improved computational speed and higher sensitivity and specificity over competing approaches. The method is used to analyze microRNA (miRNA) and long noncoding RNA (lncRNA) data from The Cancer Genome Atlas (TCGA) consortium. T… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
24
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 26 publications
(24 citation statements)
references
References 35 publications
(34 reference statements)
0
24
0
Order By: Relevance
“…Meanwhile, the TCGA project generated datasets for over 18 cancer types, which include gene expression, copy number alteration (CNA), DNA methylation, and somatic mutation profiles (10). All of these resources provided a rich base for cancer integrative analysis (11,12).…”
Section: Rna-binding Proteinmentioning
confidence: 99%
See 1 more Smart Citation
“…Meanwhile, the TCGA project generated datasets for over 18 cancer types, which include gene expression, copy number alteration (CNA), DNA methylation, and somatic mutation profiles (10). All of these resources provided a rich base for cancer integrative analysis (11,12).…”
Section: Rna-binding Proteinmentioning
confidence: 99%
“…Meanwhile, the TCGA project generated datasets for over 18 cancer types, which include gene expression, copy number alteration (CNA), DNA methylation, and somatic mutation profiles (10). All of these resources provided a rich base for cancer integrative analysis (11,12).Despite the rapid growth of genomic data, the knowledge on how gene expression programs in tumors are controlled by TFs is still limited. As one challenge, the experimental condition of public ChIP-seq data, such as stem cell line, may not match the physiological condition of a specific cancer type.…”
mentioning
confidence: 99%
“…Recently, we casted the matched data integration into a multivariate regression framework and proposed a new method, T-SVD [43]. The application example is to analyze miRNA and lncRNA data from The Cancer Genome Atlas (TCGA) consortium.…”
Section: Models and Algorithms For Matched Samplementioning
confidence: 99%
“…The reduced-rank representation has been further extended to enable variable selection [11, 6, 14], and it can be distilled from many other multivariate tools such as PCA, canonical correlation analysis, and matrix completion [9, 27]. It is evident that the reduced-rank methods have greatly facilitated scientific investigations in various disciplines, e.g., finance [39], ecology [12], neuroscience [55], genetics [31], etc.…”
Section: Introductionmentioning
confidence: 99%