Machine Learning–Based Gene Prioritization Identifies Novel Candidate Risk Genes for Inflammatory Bowel Disease

Isakov, Ofer; Dotan, Iris; Ben‐Shachar, Shay

doi:10.1097/mib.0000000000001222

Cited by 56 publications

(48 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In a logistic regression, it is typical to apply a regularization terme.g., L1 (the sum of the absolute value of feature weights) and L2 (the sum of squared feature weights) -that introduce some bias while reducing variance, thereby improving predictive ability (Demir-Kavuk et al, 2011). Isakov et al (2017) used elastic net logistic regression (Zou and Hastie, 2005) which combines L1 and L2 penalties to prioritize IBD genes. This method performs both variable selection (L1), and shrinks coefficient sizes to reduce variance (L2) (Ogutu et al, 2012).…”

Section: Machine Learning Modelsmentioning

confidence: 99%

“…Regularized logistic regression with elastic net aims to minimize the "curse of dimensionality"where data has a larger number of features than samples -which is a particular blight on GWAS. For example, Isakov et al (2017) used data consisting of 314 positive genes and 1,736 negative genes each annotated with 1,027 features. By applying logistic regression with elastic net they could then select the best data for their models (309 features selected which are predominantly from biological ontologies).…”

Section: Machine Learning Modelsmentioning

confidence: 99%

“…They found that logistic regression was the highest performing ML model -emphasizing that a classification problem may require simpler solutions. Besides ensemble learning and logistic regression, SVM is also consistently used within studies performing benchmark comparisons (Roshan et al, 2011;Isakov et al, 2017;Maciukiewicz et al, 2018;Vitsios and Petrovski, 2019). SVM aims to plot a decision boundary between groups by measuring hyperplanes -based on the distances between the most extreme samples of each classification group (Smola and Scholkopf, 2004; Figure 2).…”

Section: Machine Learning Modelsmentioning

confidence: 99%

“…Overall, there is a need for benchmarking in order to select the model best suited to the data, and for post-GWAS prioritization the optimal model currently varies across diseases without a onesize-fits-all winner. An optimal model also hinges on data size and quality for reliability and performance, with studies varying in data size and choice of features -from using hundreds of selected features (Isakov et al, 2017) to others exploring tens of thousands (Deo et al, 2014). Further in silico methods need to address these aspects of ML, the lack of functionally validated associated genes at the disposal of ML, and how features are used in order to build a model tailored to post-GWAS prioritization.…”

Section: Machine Learning Modelsmentioning

confidence: 99%

See 3 more Smart Citations

Reaching the End-Game for GWAS: Machine Learning Approaches for the Prioritization of Complex Disease Loci

et al. 2020

View full text Add to dashboard Cite

Section: Machine Learning Modelsmentioning

confidence: 99%

Section: Machine Learning Modelsmentioning

confidence: 99%

Section: Machine Learning Modelsmentioning

confidence: 99%

Section: Machine Learning Modelsmentioning

confidence: 99%

See 2 more Smart Citations

Reaching the End-Game for GWAS: Machine Learning Approaches for the Prioritization of Complex Disease Loci

et al. 2020

View full text Add to dashboard Cite

“…We noticed that some of the non-stricturing CD patients also had high circulating elafin levels, leading to moderate accuracy when elafin alone was used in identifying stricturing CD patients. Elafin alone is not enough to indicate intestinal strictures accurately because the complexity of many clinical characteristics of the patients has not been considered (Isakov, Dotan et al, 2017, Waljee, Lipson et al, 2017.…”

Section: Discussionmentioning

confidence: 99%

High circulating elafin levels are associated with Crohn’s disease-associated intestinal strictures

Wang

Ortiz

Fontenot

et al. 2019

Preprint

View full text Add to dashboard Cite

AbstractObjectiveNearly 33% of Crohn’s disease (CD) patients develop intestinal strictures. Antimicrobial peptide or protein expression is associated with disease activity in inflammatory bowel disease (IBD) patients. Circulating blood cells and intestine of IBD patients have abnormal expression of elafin, a human elastase-specific protease inhibitor and antimicrobial peptide. However, the association between elafin and CD-associated intestinal stricture is unknown. We hypothesize the elafin expression in stricturing CD patients is abnormal. We determined the expression of elafin in blood, intestine, and mesenteric fat in IBD patients.MethodsHuman colonic and mesenteric fat tissues and serum samples were collected from the Cedars-Sinai Medical Center and UCLA, respectively.ResultsHigh serum elafin levels were associated with a significantly elevated risk of intestinal stricture in CD patients. Machine learning algorithm using serum elafin levels and clinical data identified stricturing CD patients with high accuracy. Serum elafin levels had weak positive correlation with clinical disease activity (Partial Mayo Score and Harvey Bradshaw Index) in IBD patients. Ulcerative colitis (UC) patients had high serum elafin levels, but the increase was not associated with endoscopic Mayo score. Colonic elafin mRNA and protein expression were not associated with clinical disease activity in IBD patients, while stricturing CD patients had low colonic elafin expression. Mesenteric fat in stricturing CD patients had significantly increased elafin mRNA expression, which may contribute to high circulating elafin level.ConclusionHigh serum elafin levels and adipose elafin expression are associated with intestinal strictures, which may help identify intestinal strictures in CD patients.

show abstract

Inflammatory bowel disease genomics, transcriptomics, proteomics and metagenomics meet artificial intelligence

Cannarozzi,

Latiano,

Massimino

et al. 2024

UEG Journal

View full text Add to dashboard Cite

Various extrinsic and intrinsic factors such as drug exposures, antibiotic treatments, smoking, lifestyle, genetics, immune responses, and the gut microbiome characterize ulcerative colitis and Crohn's disease, collectively called inflammatory bowel disease (IBD). All these factors contribute to the complexity and heterogeneity of the disease etiology and pathogenesis leading to major challenges for the scientific community in improving management, medical treatments, genetic risk, and exposome impact. Understanding the interaction(s) among these factors and their effects on the immune system in IBD patients has prompted advances in multi‐omics research, the development of new tools as part of system biology, and more recently, artificial intelligence (AI) approaches. These innovative approaches, supported by the availability of big data and large volumes of digital medical datasets, hold promise in better understanding the natural histories, predictors of disease development, severity, complications and treatment outcomes in complex diseases, providing decision support to doctors, and promising to bring us closer to the realization of the “precision medicine” paradigm. This review aims to provide an overview of current IBD omics based on both individual (genomics, transcriptomics, proteomics, metagenomics) and multi‐omics levels, highlighting how AI can facilitate the integration of heterogeneous data to summarize our current understanding of the disease and to identify current gaps in knowledge to inform upcoming research in this field.

show abstract

Machine Learning–Based Gene Prioritization Identifies Novel Candidate Risk Genes for Inflammatory Bowel Disease

Cited by 56 publications

References 36 publications

Reaching the End-Game for GWAS: Machine Learning Approaches for the Prioritization of Complex Disease Loci

Reaching the End-Game for GWAS: Machine Learning Approaches for the Prioritization of Complex Disease Loci

High circulating elafin levels are associated with Crohn’s disease-associated intestinal strictures

Inflammatory bowel disease genomics, transcriptomics, proteomics and metagenomics meet artificial intelligence

Contact Info

Product

Resources

About