2016
DOI: 10.1155/2016/1035945
|View full text |Cite
|
Sign up to set email alerts
|

Modeling Gene Regulation in Liver Hepatocellular Carcinoma with Random Forests

Abstract: Liver hepatocellular carcinoma (HCC) remains a leading cause of cancer-related death. Poor understanding of the mechanisms underlying HCC prevents early detection and leads to high mortality. We developed a random forest model that incorporates copy-number variation, DNA methylation, transcription factor, and microRNA binding information as features to predict gene expression in HCC. Our model achieved a highly significant correlation between predicted and measured expression of held-out genes. Furthermore, we… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 25 publications
0
3
0
Order By: Relevance
“…This further supports the notion that the deregulation of the CRL2 pVHL complex in HCC is not due to a single common cause, but rather several causes that can compound to affect patient outcomes. The multimodal nature of this regulation has also been observed in HCC more generally, where models that incorporate data on CN variations, hypomethylation, and miRNA expression more accurately predict mRNA expression than those that discard any individual mode ( Kazan, 2016 ). Similar forms of multimodal deregulation have also been observed in oncogenes outside of the HCC context, including EGFR , MDM2 , and PDGFRA ( Louhimo and Hautaniemi, 2011 ), as well as other UPP genes ( Ge et al, 2018 ).…”
Section: Discussionmentioning
confidence: 92%
“…This further supports the notion that the deregulation of the CRL2 pVHL complex in HCC is not due to a single common cause, but rather several causes that can compound to affect patient outcomes. The multimodal nature of this regulation has also been observed in HCC more generally, where models that incorporate data on CN variations, hypomethylation, and miRNA expression more accurately predict mRNA expression than those that discard any individual mode ( Kazan, 2016 ). Similar forms of multimodal deregulation have also been observed in oncogenes outside of the HCC context, including EGFR , MDM2 , and PDGFRA ( Louhimo and Hautaniemi, 2011 ), as well as other UPP genes ( Ge et al, 2018 ).…”
Section: Discussionmentioning
confidence: 92%
“…The random forest (RF) algorithm is an ensemble machine learning method developed by Breiman [ 24 ]. It has been widely applied to prediction problems in bioinformatics [ 25 27 ]. The RF algorithm consists of multibase tree-structured classifiers such as CART (classification and regression tree), and it is robust to noise, is not hindered by overfitting, and is computationally feasible.…”
Section: Methodsmentioning
confidence: 99%
“…the human gut microbiome and detection of cancers such as ovarian, lung and breast 22 , 23 . Although random forest (RF) has been useful in selection of genetic features for HCC detection 24 , to our knowledge, RF has not yet been used to build an HCC screening or diagnosis test using multiple biomarkers. Because of RF’s strength in biological applications of machine learning (e.g.…”
Section: Introductionmentioning
confidence: 99%