2017
DOI: 10.1097/mib.0000000000001222
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning–Based Gene Prioritization Identifies Novel Candidate Risk Genes for Inflammatory Bowel Disease

Abstract: Our method successfully differentiated IBD-risk genes from non-IBD genes by using information from expression data and a multitude of gene annotations. Crucial features were defined, and we were able to detect novel candidate risk genes for IBD. These findings may help detect new IBD-risk genes and improve the understanding of IBD pathogenesis.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
48
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 56 publications
(48 citation statements)
references
References 36 publications
0
48
0
Order By: Relevance
“…In a logistic regression, it is typical to apply a regularization terme.g., L1 (the sum of the absolute value of feature weights) and L2 (the sum of squared feature weights) -that introduce some bias while reducing variance, thereby improving predictive ability (Demir-Kavuk et al, 2011). Isakov et al (2017) used elastic net logistic regression (Zou and Hastie, 2005) which combines L1 and L2 penalties to prioritize IBD genes. This method performs both variable selection (L1), and shrinks coefficient sizes to reduce variance (L2) (Ogutu et al, 2012).…”
Section: Machine Learning Modelsmentioning
confidence: 99%
See 3 more Smart Citations
“…In a logistic regression, it is typical to apply a regularization terme.g., L1 (the sum of the absolute value of feature weights) and L2 (the sum of squared feature weights) -that introduce some bias while reducing variance, thereby improving predictive ability (Demir-Kavuk et al, 2011). Isakov et al (2017) used elastic net logistic regression (Zou and Hastie, 2005) which combines L1 and L2 penalties to prioritize IBD genes. This method performs both variable selection (L1), and shrinks coefficient sizes to reduce variance (L2) (Ogutu et al, 2012).…”
Section: Machine Learning Modelsmentioning
confidence: 99%
“…Regularized logistic regression with elastic net aims to minimize the "curse of dimensionality"where data has a larger number of features than samples -which is a particular blight on GWAS. For example, Isakov et al (2017) used data consisting of 314 positive genes and 1,736 negative genes each annotated with 1,027 features. By applying logistic regression with elastic net they could then select the best data for their models (309 features selected which are predominantly from biological ontologies).…”
Section: Machine Learning Modelsmentioning
confidence: 99%
See 2 more Smart Citations
“…We noticed that some of the non-stricturing CD patients also had high circulating elafin levels, leading to moderate accuracy when elafin alone was used in identifying stricturing CD patients. Elafin alone is not enough to indicate intestinal strictures accurately because the complexity of many clinical characteristics of the patients has not been considered (Isakov, Dotan et al, 2017, Waljee, Lipson et al, 2017.…”
Section: Discussionmentioning
confidence: 99%