2019
DOI: 10.1515/sagmb-2018-0059
|View full text |Cite
|
Sign up to set email alerts
|

Determining the number of components in PLS regression on incomplete data set

Abstract: Partial least squares regression – or PLS regression – is a multivariate method in which the model parameters are estimated using either the SIMPLS or NIPALS algorithm. PLS regression has been extensively used in applied research because of its effectiveness in analyzing relationships between an outcome and one or several components. Note that the NIPALS algorithm can provide estimates parameters on incomplete data. The selection of the number of components used to build a representative model in PLS regressio… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(17 citation statements)
references
References 51 publications
0
13
0
2
Order By: Relevance
“…Leave-one-out crossvalidation was used to calculate the predictive power of the model due to the small number of datapoints and large number of variables evaluated. The number of model components was determined based on the Q 2 value of the cross-validation, such that additional components were only included if significant improvements to the model occurred, which were identified by increases in Q 2 of at least 0.0975 (28,29) . The variables were sorted by variable influence on projection (VIP) and the model was run iteratively including additional variables until the Q 2 score no longer improved.…”
Section: Discussionmentioning
confidence: 99%
“…Leave-one-out crossvalidation was used to calculate the predictive power of the model due to the small number of datapoints and large number of variables evaluated. The number of model components was determined based on the Q 2 value of the cross-validation, such that additional components were only included if significant improvements to the model occurred, which were identified by increases in Q 2 of at least 0.0975 (28,29) . The variables were sorted by variable influence on projection (VIP) and the model was run iteratively including additional variables until the Q 2 score no longer improved.…”
Section: Discussionmentioning
confidence: 99%
“…Participants' scores on the principal components defining the constructs were subjected to Orthogonal Projection to Latent Structures analysis (OPLS) (Trygg and Wold, 2002) for analysis of the relative importance of motivational and situational variables for predicting student achievement (Research question 1) and the relative importance of situational variables for predicting Students' motivational characteristics (Research question 2). The analyses were performed in the SIMCA P+ software (Umetrics, 2013), using the modified non-linear iterative partial least squares regression algorithm (NIPALS; Wold, 1975;Nengsih et al, 2019) for imputing missing data.…”
Section: Discussionmentioning
confidence: 99%
“…In addition to scans of the entire core, four individual aggregates (4-8 mm) of each soil were also scanned with X-ray CT (80 kv, 75 µA, 1 s exposure time, no filter, 2400 projections, two frames per projection), reconstructed in 8 bit at a voxel resolution of 5 µm, filtered with a 2D non-local means filter, and segmented into pores and background with the Otsu thresholding method (Otsu, 1975). The largest cuboid fully inscribed in an aggregate was cut and used for subsequent diffusion modelling as described below.…”
Section: Microstructure Analysismentioning
confidence: 99%