2022
DOI: 10.1038/s41593-022-01059-9
|View full text |Cite
|
Sign up to set email alerts
|

Meta-matching as a simple framework to translate phenotypic predictive models from big to small data

Abstract: There is significant interest in using brain imaging data to predict non-brain-imaging phenotypes in individual participants. However, most prediction studies are underpowered, relying on less than a few hundred participants, leading to low reliability and inflated prediction performance. Yet, small sample sizes are unavoidable when studying clinical populations or addressing focused neuroscience questions. Here, we propose a simple framework -"meta-matching" -to translate predictive models from large-scale da… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

3
57
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 48 publications
(66 citation statements)
references
References 72 publications
3
57
0
Order By: Relevance
“…However, most of these studies have utilized the same datasets, primarily the widely used Human Connectome Project (HCP) database [66], with a medium sample size of fewer than 1500 participants. Cognitive prediction studies that use the UK Biobank database have a lot more participants than HCP, but as far as the authors know, only a few studies with large fMRI datasets have been done so far [22], [56], [57]. This makes extrapolating the findings of low sample size studies to larger sample sizes challenging [5].…”
Section: Discussionmentioning
confidence: 99%
“…However, most of these studies have utilized the same datasets, primarily the widely used Human Connectome Project (HCP) database [66], with a medium sample size of fewer than 1500 participants. Cognitive prediction studies that use the UK Biobank database have a lot more participants than HCP, but as far as the authors know, only a few studies with large fMRI datasets have been done so far [22], [56], [57]. This makes extrapolating the findings of low sample size studies to larger sample sizes challenging [5].…”
Section: Discussionmentioning
confidence: 99%
“…As it is not needed for the behavioral variables to be the same to still arrive at similar summary dimensions, datasets with very different behavioral assays can be combined to train predictive models, if one is willing to go one step up in granularity and predict these broader behavioral dimensions. A similar approach to this was already demonstrated by He and colleagues 46 , who leveraged the correlations between behavioral variables to improve the predictive accuracy of predictive models using the functional connectome. There are several avenues to build on these results.…”
Section: Discussionmentioning
confidence: 77%
“…Even for balanced datasets, trained models may still be biased due to unobserved confounders, e.g., severity of the disease or genetic factors. Recent studies have therefore argued for training models on extensive multisource neuroimaging datasets ( 24 ). Our results agree with such motivations—large-scale cohorts of multisource data can enable training robust and unbiased models and also enable thorough evaluation.…”
Section: Discussionmentioning
confidence: 99%