2019
DOI: 10.1002/cem.3120
|View full text |Cite
|
Sign up to set email alerts
|

SO‐CovSel: A novel method for variable selection in a multiblock framework

Abstract: With the development of technology and the relatively higher availability of new instrumentations, having multiblock data sets (eg, a set of samples analyzed by different analytical techniques) is becoming more and more common and, as a consequence, how to handle this kind of outcomes is a widely discussed topic. In such a context, where the number of involved variables is relatively high, selecting the most significant features is obviously relevant.For this reason, the possibility of joining a multiblock reg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
43
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
8
1

Relationship

4
5

Authors

Journals

citations
Cited by 44 publications
(43 citation statements)
references
References 23 publications
(31 reference statements)
0
43
0
Order By: Relevance
“…After import into MatLab, serum concentrations of inflammatory and metabolic markers and the abundance of gut microbial OTUs were organized into three matrices (Table 2), to be further processed through a multi-block approach. Given its ability to provide accurate predictions and, at the same time, to identify a parsimonious number of relevant variables (putative markers), the analysis was carried out through the recently developed SO-CovSel algorithm [32]. 1-methylhistidine, 3-methylhistidine, 4-hydroxyproline, α-aminobutyric acid, β-alanine, β-aminobutyric acid, γ-aminobutyric acid, alanine, aminoadipic acid, anserine, arginine, asparagine, aspartic acid, carnosine, citrulline, cystathionine, cystine, ethanolamine, glutamic acid, glycine, histidine, isoleucine, leucine, lysine, methionine, ornithine, phenylalanine, phosphoethanolamine, phosphoserine, proline, sarcosine, serine, taurine, threonine, tryptophan, tyrosine, valine…”
Section: Discussionmentioning
confidence: 99%
“…After import into MatLab, serum concentrations of inflammatory and metabolic markers and the abundance of gut microbial OTUs were organized into three matrices (Table 2), to be further processed through a multi-block approach. Given its ability to provide accurate predictions and, at the same time, to identify a parsimonious number of relevant variables (putative markers), the analysis was carried out through the recently developed SO-CovSel algorithm [32]. 1-methylhistidine, 3-methylhistidine, 4-hydroxyproline, α-aminobutyric acid, β-alanine, β-aminobutyric acid, γ-aminobutyric acid, alanine, aminoadipic acid, anserine, arginine, asparagine, aspartic acid, carnosine, citrulline, cystathionine, cystine, ethanolamine, glutamic acid, glycine, histidine, isoleucine, leucine, lysine, methionine, ornithine, phenylalanine, phosphoethanolamine, phosphoserine, proline, sarcosine, serine, taurine, threonine, tryptophan, tyrosine, valine…”
Section: Discussionmentioning
confidence: 99%
“…In order to identify relationships among different sets of data and to identify putative markers of PF&S, a recently proposed multi-block classification approach, called SO-CovSel–LDA (Biancolillo et al, 2020 ), was adopted. SO-CovSel–LDA is a highly efficient multi-block classification strategy, which combines a very parsimonious variable selection algorithm to be applied on each individual block (CovSel) (Roger et al, 2011 ) with the sequential inclusion of data matrices, after orthogonalization with respect to the previously selected variables.…”
Section: Methodsmentioning
confidence: 99%
“…In the present preliminary study, we took advantage of the well-characterized cohort of older adults recruited in the “BIOmarkers associated with Sarcopenia and Physical frailty in EldeRly pErsons” (BIOSPHERE) study (Calvani et al, 2018b , c ; Marzetti et al, 2019 ; Picca et al, 2019a , 2020a ) to simultaneously analyze biomediators pertaining to three different domains: inflammation, amino acid metabolism, and mitochondrial quality control (MQC). The availability of systemic inflammatory and metabolic data from this cohort (Calvani et al, 2018b ; Marzetti et al, 2019 ) and their complementation with the analysis of circulating MDVs (Picca et al, 2020a ) provided a composite dataset to explore the relationship among systemic inflammation, metabolic characteristics, and MDV trafficking in PF&S. Data analysis was performed through sequential and orthogonalized covariance selection coupled with linear discriminant analysis (SO-CovSel–LDA), an innovative analytical strategy that is particularly suited for dealing with multi-block datasets (i.e., experimental settings in which variables are assayed using different platforms and/or at different time points) (Biancolillo et al, 2020 ). SO-CovSel–LDA enabled selecting the variables of interest for PF&S from a large number of highly correlated candidate biomarkers.…”
Section: Introductionmentioning
confidence: 99%
“…PLS-DA offers the advantage of processing datasets containing a high number of variables even if they are highly correlated with one another. The analysis of the whole dataset, which encompasses multi-block data, will be performed through the recently developed sequential and orthogonalized covariance selection (SO-CovSel) [48]. The method, which allows a highly parsimonious variable selection, was used by our group for the identification of a gut microbial, inflammatory and metabolic fingerprint in older adults with physical frailty & sarcopenia [41].…”
Section: Discussionmentioning
confidence: 99%