2020
DOI: 10.1007/s11306-020-1640-0
|View full text |Cite
|
Sign up to set email alerts
|

Migrating from partial least squares discriminant analysis to artificial neural networks: a comparison of functionally equivalent visualisation and feature contribution tools using jupyter notebooks

Abstract: Introduction Metabolomics data is commonly modelled multivariately using partial least squares discriminant analysis (PLS-DA). Its success is primarily due to ease of interpretation, through projection to latent structures, and transparent assessment of feature importance using regression coefficients and Variable Importance in Projection scores. In recent years several non-linear machine learning (ML) methods have grown in popularity but with limited uptake essentially due to convoluted optimisation and inter… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 36 publications
(19 citation statements)
references
References 47 publications
0
19
0
Order By: Relevance
“…Next, we aimed to assess the robustness of the metabolites selected by the ML model. We used bootstrap resampling of our training dataset to construct confidence intervals for the variable importance of each of the 707 metabolites profiled (Mendez et al, 2020). The analysis led to the identification of 25 metabolites that significantly contributed to the model's fit.…”
Section: Methodsmentioning
confidence: 99%
“…Next, we aimed to assess the robustness of the metabolites selected by the ML model. We used bootstrap resampling of our training dataset to construct confidence intervals for the variable importance of each of the 707 metabolites profiled (Mendez et al, 2020). The analysis led to the identification of 25 metabolites that significantly contributed to the model's fit.…”
Section: Methodsmentioning
confidence: 99%
“…We used bootstrap resampling of our training dataset to construct confidence intervals for the variable importance of each of the 707 metabolites we profiled. 35 The analysis led to the identification of 22 predictor metabolites that significantly contributed to the model's fit. The structural identities of these metabolites were rigorously confirmed (see method details).…”
Section: Predictive Model Of Covid-19 Disease Severitymentioning
confidence: 99%
“…In particular, Witten and Tibshirani [43] developed a supervised sparse canonical correlation model in order to find significant linear combinations between copy number and gene expression data. For an extensive reading on the extended family of PCA and PLS methods we refer the reader to Mishra et al [28], O'Shea and Misra [35], Gromski et al [94], Mendez et al [95].…”
Section: Study Focusmentioning
confidence: 99%