2021
DOI: 10.1016/j.pscychresns.2021.111313
|View full text |Cite
|
Sign up to set email alerts
|

Biased accuracy in multisite machine-learning studies due to incomplete removal of the effects of the site

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 14 publications
(16 citation statements)
references
References 16 publications
0
14
0
Order By: Relevance
“…harvard.edu/fswiki/CorticalParcellation, www.frontiersin.org/articles/ 10.3389/fnins.2012.00171/full#h12 and https://freesurfer.net/fswiki/ SubcorticalSegmentation). For UKB, the MRI data were residualised with respect to scanning site (Alfaro-Almagro et al, 2021;Solanes et al, 2021) using linear models. To remove poor-quality data likely due to subject motion, UKB participants with Euler numbers (Rosen et al, 2018) of ≥3 SDs from the mean were identified and excluded (N = 778; de Lange, Barth, et al, 2020).…”
Section: Mri Data Acquisition and Processingmentioning
confidence: 99%
See 1 more Smart Citation
“…harvard.edu/fswiki/CorticalParcellation, www.frontiersin.org/articles/ 10.3389/fnins.2012.00171/full#h12 and https://freesurfer.net/fswiki/ SubcorticalSegmentation). For UKB, the MRI data were residualised with respect to scanning site (Alfaro-Almagro et al, 2021;Solanes et al, 2021) using linear models. To remove poor-quality data likely due to subject motion, UKB participants with Euler numbers (Rosen et al, 2018) of ≥3 SDs from the mean were identified and excluded (N = 778; de Lange, Barth, et al, 2020).…”
Section: Mri Data Acquisition and Processingmentioning
confidence: 99%
“…When adjusting for age-bias using fit coefficients derived from a training set to correct the predictions in independent test sets, the results were highly comparable (Table4). To check for potential scanning site effects(Alfaro- Almagro et al, 2021;Solanes et al, 2021), we plotted the UKB delta distributions and calculated the correlation between predicted and true age (r) for each site separately. As shown in FigureS2, the results were similar across the three sites.3.2 | Effects of age range and sample sizeThe age-bias corrected r and R 2 values are generally larger for all models, and the corrected values decrease with a narrower age range.In this scenario, the prediction variance is similar across test sets, which is a result of the training set being held constant.…”
mentioning
confidence: 99%
“…A major concern in neuroimaging research is the effect of site on the generalizability of ML models (Dockes et al, 2021; Solanes et al, 2021). Sites may differ in terms of scanner infrastructure, acquisition protocols and neuroimaging feature extraction pipelines as well as sample composition.…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, these potential effects of the site must be very carefully controlled. Ignoring them may yield an inflated accuracy, even when the models do not really predict ( 76 , 82 ).…”
Section: Common Pitfalls and Errorsmentioning
confidence: 99%
“…Here is where a technique called Transfer Learning appears. This approach can extract insights obtained in large general-purpose datasets and use that information to improve small dataset model creation ( 82 ). This technique has already been tested to improve Alzheimer’s disease classification ( 92 ).…”
Section: Challenges and Latest Advancementsmentioning
confidence: 99%